This contains my bachelors thesis and associated tex files, code snippets and maybe more.
Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Constantin Fürst
94669924c8
implement cache in aggrj for qdp
11 months ago
..
mtsubmit-bench
add option for internal repetitions to benchmarks which allows the small copies of 1kib to run long enough for the timings to become usable (goal is about 1s runtime for each iteration)
11 months ago
peak-perf-1dsa
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
peak-perf-2dsa
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
peak-perf-4dsa
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
peak-perf-8cpu
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
peak-perf-8dsa
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
submit-bench
modify submission benchmark descriptors to have x10 internal repetitions
11 months ago
copy-debug-n0ton0-cpu.json
add option for internal repetitions to benchmarks which allows the small copies of 1kib to run long enough for the timings to become usable (goal is about 1s runtime for each iteration)
11 months ago