Constantin Fürst
|
e99bf619c2
|
handle memory allocation outside of the cache, pre-allocate in benchmark and memset to hopefully guarantee no pagefaults will be encountered
|
11 months ago |
Constantin Fürst
|
19ef2df856
|
update perf profile with manually disabled huge pages
|
11 months ago |
Constantin Fürst
|
d4677b3c59
|
measure performance without huge pages on
|
11 months ago |
Constantin Fürst
|
7afcffbefa
|
set correct node in perf recording script
|
11 months ago |
Constantin Fürst
|
87b71e6300
|
remove warings for weak access failure
|
11 months ago |
Constantin Fürst
|
c86d517444
|
fix the published results for prefetching
|
11 months ago |
Constantin Fürst
|
94b3576d5a
|
publish measurements from benchmark
|
11 months ago |
Constantin Fürst
|
8999fe4ca3
|
share current config for qdp bench from vampir
|
11 months ago |
Constantin Fürst
|
79a7e9637c
|
fix benchmark by waiting and not dropping barrier in aggrj
|
11 months ago |
Constantin Fürst
|
01ecdce88f
|
set execution engine to hardware
|
11 months ago |
Constantin Fürst
|
bc1c3d0096
|
fix block size for access by cacher in scanb
|
11 months ago |
Constantin Fürst
|
f4159d8149
|
fix old flag name from FLAG_TRIGGER_PAGES to FLAG_FORCE_MAP_PAGES
|
11 months ago |
Constantin Fürst
|
99552b3de4
|
add option for forcing map of pages by touching each one with a write at its begin, required as somehow behaviour changed, cache was experiencing page fault errors and handling by dsa is simply too slow
|
11 months ago |
Constantin Fürst
|
5044b4419c
|
make load balancing thread-local to reduce atomic cost
|
11 months ago |
Constantin Fürst
|
f9d47d3a45
|
add scanb back to the barrier, now other threads will wait for finish of work submission
|
11 months ago |
Constantin Fürst
|
006b856c44
|
resolve issues from the recent reset of qdp benchmark
|
11 months ago |
Constantin Fürst
|
de1de9134b
|
reset benchmark
|
11 months ago |
Constantin Fürst
|
c7b91e400f
|
reset to using hardware path, sw was only temporarily used for debugging
|
11 months ago |
Constantin Fürst
|
7abd83ca45
|
fix offloading cache bug, introduced by not using maxptr for secondary invalid value, more care needed to be taken on when the secondary invalid would be set and how, now we set in in cachedata:init for both the local task in access and the one in cache state
|
11 months ago |
Constantin Fürst
|
4a587a36e2
|
remove overlap-execution barriers and run for the entire block
|
11 months ago |
Constantin Fürst
|
73914d4cdd
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
8dfefe665e
|
add warnings to the offloading cacher
|
11 months ago |
Constantin Fürst
|
c393b8eb88
|
improve load balancing node assignment
|
11 months ago |
Constantin Fürst
|
b9cfb818fd
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
21702d5309
|
remove sub and overchunking for scanb caching, use the per-iteration barriers again
|
11 months ago |
Constantin Fürst
|
7d614769db
|
remove forgotten access to load timer
|
11 months ago |
Constantin Fürst
|
6a4eec37ca
|
remove vector-load timing as its too expensive
|
11 months ago |
Constantin Fürst
|
93a281fa26
|
improve debug output for relwithdebinfo in qdp, fix filename for record perf script, add perf.svg with better debug info
|
11 months ago |
Constantin Fürst
|
624e8b55ea
|
add script to record perf and make the flame graph
|
11 months ago |
Constantin Fürst
|
942d7be7e9
|
redo benchmarks for qdp
|
11 months ago |
Constantin Fürst
|
a83f208cd2
|
fix time evaluation for qdp bench
|
11 months ago |
Constantin Fürst
|
cc35131df8
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
cc8d203771
|
redo benchmarks for qdp, move previous results to old (folder)
|
11 months ago |
Constantin Fürst
|
94669924c8
|
implement cache in aggrj for qdp
|
11 months ago |
Constantin Fürst
|
b7559a18a6
|
remove cerr warning
|
11 months ago |
Constantin Fürst
|
c7877ecdf6
|
remove skeleton of now defunct function in qdp
|
11 months ago |
Constantin Fürst
|
20c6e54df7
|
remove broken implementation for non-divisible chunk-group-thread-counts
|
11 months ago |
Constantin Fürst
|
a3a8dff1aa
|
reset some changes to the aggregation and filter functions not quite needed
|
11 months ago |
Constantin Fürst
|
69aec6fa48
|
add plotter for the results of qdp which turns them into a donut-graph
|
11 months ago |
Constantin Fürst
|
122eab35b7
|
modify benchmarking code to measure time spent loading vectors too
|
11 months ago |
Constantin Fürst
|
3fc9bc6145
|
finish the section on using cache with qdp, also add a section on possible usage of dwcas for aba problem solution
|
11 months ago |
Constantin Fürst
|
98835799d4
|
also copy the invalid handlers pointer (see last commit) on CacheData copy constructor
|
11 months ago |
Constantin Fürst
|
75d72ce556
|
dont use uint64::max as invalid pointer but allocate a valid but empty vector, using its address as invalid
|
11 months ago |
Constantin Fürst
|
43254d0f3c
|
note that the cost observed for swq submission is lower than what reese kuper saw
|
11 months ago |
Constantin Fürst
|
8dcc248620
|
first pass of modifications to abstract and intro
|
11 months ago |
Constantin Fürst
|
b55196a7a7
|
redo reference types, using misc with howpublished for content that is not scientific work
|
11 months ago |
Constantin Fürst
|
94ee998c2e
|
finalize draft of abstract and intro
|
11 months ago |
Constantin Fürst
|
9bb68a7244
|
remove unnecessary cite by andres advice
|
11 months ago |
Constantin Fürst
|
8b635f7e87
|
update bachelor.pdf with recent changes
|
11 months ago |
Constantin Fürst
|
58f36279ec
|
reformulate implementation chapter - our chocie has changed and we now use push-pull and not smart-copy anymore for load balancer
|
11 months ago |