This contains my bachelors thesis and associated tex files, code snippets and maybe more.
Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Constantin Fürst
840f150cc1
remove the cacheaccess timing plot which we are not going to use, for this a flamegraph is more appropriate but this analysis is too much anyways
11 months ago
..
evaluation-results
change config to allow 64 threads for stage 1 and 32 for stage 2 in all benchmarks
11 months ago
plots
add table output to qdp result plotter which displays speedup compared to dram as baseline, redo the timing plots with the latest test results, add the speedup table
11 months ago
src
change config to allow 64 threads for stage 1 and 32 for stage 2 in all benchmarks
11 months ago
.gitignore
add query driven prefetching code repository copy
11 months ago
CMakeLists.txt
improve debug output for relwithdebinfo in qdp, fix filename for record perf script, add perf.svg with better debug info
11 months ago
README.md
prettify credit in the readme
11 months ago
bench_max.sh
add prelimianry results, modify the launch script and provide macro-based selection of parameters for the three modes {dram,hbm,prefetch}
11 months ago
plotter.py
add table output to qdp result plotter which displays speedup compared to dram as baseline, redo the timing plots with the latest test results, add the speedup table
11 months ago
record-perf.sh
set correct node in perf recording script
11 months ago