This contains my bachelors thesis and associated tex files, code snippets and maybe more.
Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Constantin Fürst
1021197009
do not use the float-force parameter H but move to a more dynmic one (h!tb), also dont use Subsection anywhere instead use Section, add a small paragraph to the implementation chapter stating how we used the cache in qdp
11 months ago
..
evaluation-results
again, redo the perf-eval with reduced data size and load to prevent missing frames, the second
11 months ago
src
modification to qdp benchmark, returns to per-chunk barrier wait, uses userspace semaphore for one-way barrier from scan_b to aggr_j as scan_b should submit asap but aggr_j should wait on submission from scan_b, contains TODO for modifying code to support chunkcount not divisible by 2
11 months ago
.gitignore
add query driven prefetching code repository copy
11 months ago
CMakeLists.txt
reworking the qdp benchmark
11 months ago
README.md
prettify credit in the readme
11 months ago
bench_max.sh
add prelimianry results, modify the launch script and provide macro-based selection of parameters for the three modes {dram,hbm,prefetch}
11 months ago