This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Constantin Fürst d1cc3e3b0c modification to qdp benchmark, returns to per-chunk barrier wait, uses userspace semaphore for one-way barrier from scan_b to aggr_j as scan_b should submit asap but aggr_j should wait on submission from scan_b, contains TODO for modifying code to support chunkcount not divisible by 2 11 months ago
..
plot-allnodes-cpu-throughput.pdf add new benchmark plots from the rewritten microbench 11 months ago
plot-allnodes-throughput.pdf add new benchmark plots from the rewritten microbench 11 months ago
plot-brute-cpu-throughput.pdf add new benchmark plots from the rewritten microbench 11 months ago
plot-mtsubmit.pdf add new benchmark plots from the rewritten microbench 11 months ago
plot-smart-throughput.pdf add new benchmark plots from the rewritten microbench 11 months ago
plot-submitmethod.pdf add new benchmark plots from the rewritten microbench 11 months ago