Constantin Fürst
|
a4dac61730
|
change config to allow 64 threads for stage 1 and 32 for stage 2 in all benchmarks
|
11 months ago |
Constantin Fürst
|
567a24f8c0
|
add hbm baseline and reorganize folders again
|
11 months ago |
Constantin Fürst
|
f6c43a6659
|
restructure evaluation results folder again
|
11 months ago |
Constantin Fürst
|
6070f320f5
|
add results for distributed locations prefetching
|
11 months ago |
Constantin Fürst
|
3c7c7852a5
|
remeassure performance for out of cache allocation
|
11 months ago |
Constantin Fürst
|
b710aec5fe
|
restructure evaluation results, add new results with out of cache allocation
|
11 months ago |
Constantin Fürst
|
c5022105cb
|
publish current configuration for testing
|
11 months ago |
Constantin Fürst
|
16e47a862f
|
allocate the correct amount of chunks for caching (missing was the run count) and add them to the queue for each run
|
11 months ago |
Constantin Fürst
|
05ffea0bdb
|
use the correct free-function in cache deallocation on error in waitoncompletion
|
11 months ago |
Constantin Fürst
|
e99bf619c2
|
handle memory allocation outside of the cache, pre-allocate in benchmark and memset to hopefully guarantee no pagefaults will be encountered
|
11 months ago |
Constantin Fürst
|
19ef2df856
|
update perf profile with manually disabled huge pages
|
11 months ago |
Constantin Fürst
|
d4677b3c59
|
measure performance without huge pages on
|
11 months ago |
Constantin Fürst
|
7afcffbefa
|
set correct node in perf recording script
|
11 months ago |
Constantin Fürst
|
87b71e6300
|
remove warings for weak access failure
|
11 months ago |
Constantin Fürst
|
c86d517444
|
fix the published results for prefetching
|
11 months ago |
Constantin Fürst
|
94b3576d5a
|
publish measurements from benchmark
|
11 months ago |
Constantin Fürst
|
8999fe4ca3
|
share current config for qdp bench from vampir
|
11 months ago |
Constantin Fürst
|
79a7e9637c
|
fix benchmark by waiting and not dropping barrier in aggrj
|
11 months ago |
Constantin Fürst
|
01ecdce88f
|
set execution engine to hardware
|
11 months ago |
Constantin Fürst
|
bc1c3d0096
|
fix block size for access by cacher in scanb
|
11 months ago |
Constantin Fürst
|
f4159d8149
|
fix old flag name from FLAG_TRIGGER_PAGES to FLAG_FORCE_MAP_PAGES
|
11 months ago |
Constantin Fürst
|
99552b3de4
|
add option for forcing map of pages by touching each one with a write at its begin, required as somehow behaviour changed, cache was experiencing page fault errors and handling by dsa is simply too slow
|
11 months ago |
Constantin Fürst
|
5044b4419c
|
make load balancing thread-local to reduce atomic cost
|
11 months ago |
Constantin Fürst
|
f9d47d3a45
|
add scanb back to the barrier, now other threads will wait for finish of work submission
|
11 months ago |
Constantin Fürst
|
006b856c44
|
resolve issues from the recent reset of qdp benchmark
|
11 months ago |
Constantin Fürst
|
de1de9134b
|
reset benchmark
|
11 months ago |
Constantin Fürst
|
c7b91e400f
|
reset to using hardware path, sw was only temporarily used for debugging
|
11 months ago |
Constantin Fürst
|
7abd83ca45
|
fix offloading cache bug, introduced by not using maxptr for secondary invalid value, more care needed to be taken on when the secondary invalid would be set and how, now we set in in cachedata:init for both the local task in access and the one in cache state
|
11 months ago |
Constantin Fürst
|
4a587a36e2
|
remove overlap-execution barriers and run for the entire block
|
11 months ago |
Constantin Fürst
|
73914d4cdd
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
8dfefe665e
|
add warnings to the offloading cacher
|
11 months ago |
Constantin Fürst
|
c393b8eb88
|
improve load balancing node assignment
|
11 months ago |
Constantin Fürst
|
b9cfb818fd
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
21702d5309
|
remove sub and overchunking for scanb caching, use the per-iteration barriers again
|
11 months ago |
Constantin Fürst
|
7d614769db
|
remove forgotten access to load timer
|
11 months ago |
Constantin Fürst
|
6a4eec37ca
|
remove vector-load timing as its too expensive
|
11 months ago |
Constantin Fürst
|
93a281fa26
|
improve debug output for relwithdebinfo in qdp, fix filename for record perf script, add perf.svg with better debug info
|
11 months ago |
Constantin Fürst
|
624e8b55ea
|
add script to record perf and make the flame graph
|
11 months ago |
Constantin Fürst
|
942d7be7e9
|
redo benchmarks for qdp
|
11 months ago |
Constantin Fürst
|
a83f208cd2
|
fix time evaluation for qdp bench
|
11 months ago |
Constantin Fürst
|
cc35131df8
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
cc8d203771
|
redo benchmarks for qdp, move previous results to old (folder)
|
11 months ago |
Constantin Fürst
|
94669924c8
|
implement cache in aggrj for qdp
|
11 months ago |
Constantin Fürst
|
b7559a18a6
|
remove cerr warning
|
11 months ago |
Constantin Fürst
|
c7877ecdf6
|
remove skeleton of now defunct function in qdp
|
11 months ago |
Constantin Fürst
|
20c6e54df7
|
remove broken implementation for non-divisible chunk-group-thread-counts
|
11 months ago |
Constantin Fürst
|
a3a8dff1aa
|
reset some changes to the aggregation and filter functions not quite needed
|
11 months ago |
Constantin Fürst
|
69aec6fa48
|
add plotter for the results of qdp which turns them into a donut-graph
|
11 months ago |
Constantin Fürst
|
122eab35b7
|
modify benchmarking code to measure time spent loading vectors too
|
11 months ago |
Constantin Fürst
|
3fc9bc6145
|
finish the section on using cache with qdp, also add a section on possible usage of dwcas for aba problem solution
|
11 months ago |