Constantin Fürst
|
4a6529b111
|
revamp chunk calculation
|
11 months ago |
Constantin Fürst
|
6502f95bb2
|
correct mode naming of the execution modes
|
11 months ago |
Constantin Fürst
|
159a8215a2
|
update results to latest
|
11 months ago |
Constantin Fürst
|
2f9d059252
|
add prelimianry results, modify the launch script and provide macro-based selection of parameters for the three modes {dram,hbm,prefetch}
|
11 months ago |
Constantin Fürst
|
1820166e6f
|
slight refactoring of qdp benchmark and add mandatory wait to cachedata in aggrj
|
11 months ago |
Constantin Fürst
|
52132522a3
|
fix scanb not working with less than aggrj threads
|
11 months ago |
Constantin Fürst
|
e34b4df7e6
|
fix wrong linebreak in result writer
|
11 months ago |
Constantin Fürst
|
f1ac2a07b2
|
write result to ofile in qdp bench
|
11 months ago |
Constantin Fürst
|
457a3b520a
|
add barriers to the qdp benchmark
|
11 months ago |
Constantin Fürst
|
aa0867aa3a
|
reworking the qdp benchmark
|
11 months ago |
Constantin Fürst
|
ab217cb080
|
remove bad opt mode from last commit and instead try to improve default prefetching
|
11 months ago |
Constantin Fürst
|
2e0f637363
|
add optimal dsa caching mode
|
11 months ago |
Constantin Fürst
|
f16d67f67e
|
wait per iteration for caching mode
|
11 months ago |
Constantin Fürst
|
5c896dbf04
|
publish best run config for dsa-prefetch
|
11 months ago |
Constantin Fürst
|
635fb01c14
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
0f843d9282
|
potential fix for lifetime of cache data take two
|
11 months ago |
Constantin Fürst
|
85228f3997
|
Merge branch 'master' of https://git.constantin-fuerst.com/constantin/bachelor-thesis
|
11 months ago |
Constantin Fürst
|
74f659e5a4
|
potential fix for lifetime of cache data
|
11 months ago |
Constantin Fürst
|
2c042c7aa0
|
remove manually set build dir and unused build parameters
|
11 months ago |
Constantin Fürst
|
af4e3de80c
|
enable qdp testing for dram as baseline, pre-allocated hbm as peak and the already existing dsa-hbm-prefetch
|
11 months ago |
Constantin Fürst
|
40b3dcff57
|
commit first benchmark results for cacher in qdp
|
11 months ago |
Constantin Fürst
|
0d1b575bcd
|
increase thread count and correct mode printed to outfile
|
11 months ago |
Constantin Fürst
|
2c5425577e
|
correct missing mode-parameter
|
11 months ago |
Constantin Fürst
|
6a90fd6c5e
|
try to fix the barriers causing lock
|
11 months ago |
Constantin Fürst
|
25187e2995
|
add checks for output file success and modify their location
|
11 months ago |
Constantin Fürst
|
26d584eaa3
|
add benchmarking loop to qdp project
|
11 months ago |
Constantin Fürst
|
5026e1ae99
|
prepare qdp project for test run
|
11 months ago |
Constantin Fürst
|
d06f0d0c6c
|
re-evaluate peak perf benchs
|
11 months ago |
Constantin Fürst
|
e7d10fc2d2
|
use hardware path instead of automatic for cache
|
11 months ago |
Constantin Fürst
|
0cf9e91204
|
merge diverging branches from remote and local on crobat
|
11 months ago |
Constantin Fürst
|
1af027417b
|
re-run benchmark peakperf from n0
|
11 months ago |
Constantin Fürst
|
cf14cf34ac
|
:wqsubdivide the descriptors for peak perf allnodes into fromn0 and fromn1to15
|
11 months ago |
Constantin Fürst
|
fdc72df6de
|
add results for cpu performance and analysis thereof to chapter 3
|
11 months ago |
Constantin Fürst
|
9cd6a41205
|
add benchmark evaluations for software and new evaluations in pdf format
|
11 months ago |
Constantin Fürst
|
d73b1cea68
|
add brute sw path results
|
11 months ago |
Constantin Fürst
|
f2059d4d47
|
use 12 threads for each brute benchmark
|
11 months ago |
Constantin Fürst
|
439ec97c8e
|
remove smart sw copy, add brute sw copy bench
|
11 months ago |
Constantin Fürst
|
24f96247fc
|
re-run cpu peak perf for allnodes with software path correctly set
|
11 months ago |
Constantin Fürst
|
e9090fb482
|
redo benchmarks for allnodes cpu due to mistake in modifying the json files
|
11 months ago |
Constantin Fürst
|
317a74d164
|
benchmark software peak performance for smart and brute force from node 0 for thesis
|
11 months ago |
Constantin Fürst
|
3d83dbba80
|
redo all possible figures in svalable pdf and include these, provide longer image captions and apply some misc recommendations from andre
|
11 months ago |
Constantin Fürst
|
ba9dae8bde
|
export plots from the benchmarks as pdf for scalability and integration with thesis
|
11 months ago |
Constantin Fürst
|
56805a6ad3
|
review of everything written today, some rewording and mostly adding todos to anything that sticks out as less than optimal
|
11 months ago |
Constantin Fürst
|
b8ce6b3add
|
shorten one statement in the waitoncompletion nsd to make it slimmer and therefore easier to read
|
11 months ago |
Constantin Fürst
|
ed019a672e
|
remove inner-loop timings from the benchmark pseudocode structo, these are not evaluated and pollute the diagram
|
11 months ago |
Constantin Fürst
|
8af4f46d0b
|
add unit to y-axis-label for peakthroughput plots
|
11 months ago |
Constantin Fürst
|
594a2e62cf
|
update modified references to chapter 3 which have turned from sections into subsections
|
11 months ago |
Constantin Fürst
|
ada6d8811a
|
continue writing chapter 3 microbenchmarks
|
11 months ago |
Constantin Fürst
|
718ce39693
|
plot the throughput benchmark for source node 0 and destination nodess {8,11,12,15} only and as a bar plot for use in thesis
|
11 months ago |
Constantin Fürst
|
1ec7d438b2
|
rework the multithread benchmark plot to be more compact for use in the thesis
|
11 months ago |