This contains my bachelors thesis and associated tex files, code snippets and maybe more.
Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Constantin Fürst
d1cc3e3b0c
modification to qdp benchmark, returns to per-chunk barrier wait, uses userspace semaphore for one-way barrier from scan_b to aggr_j as scan_b should submit asap but aggr_j should wait on submission from scan_b, contains TODO for modifying code to support chunkcount not divisible by 2
11 months ago
..
evaluation-results
again, redo the perf-eval with reduced data size and load to prevent missing frames, the second
11 months ago
src
modification to qdp benchmark, returns to per-chunk barrier wait, uses userspace semaphore for one-way barrier from scan_b to aggr_j as scan_b should submit asap but aggr_j should wait on submission from scan_b, contains TODO for modifying code to support chunkcount not divisible by 2
11 months ago
.gitignore
add query driven prefetching code repository copy
11 months ago
CMakeLists.txt
reworking the qdp benchmark
11 months ago
README.md
prettify credit in the readme
11 months ago
bench_max.sh
add prelimianry results, modify the launch script and provide macro-based selection of parameters for the three modes {dram,hbm,prefetch}
11 months ago