This contains my bachelors thesis and associated tex files, code snippets and maybe more.
Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics
Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
Constantin Fürst
1b6c60c49b
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
..
benchmark-descriptors
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
benchmark-plots
manually add data from communication with andre to the cpu benchmark plotters
11 months ago
benchmark-plotters
benchmark copy throughput for using 1,2,4,8 dsas and remove brute cpu bench (we steal it from andre)
11 months ago
benchmark-results
redo benchmarks for pushpull
11 months ago
cmake/ modules
correctly link the numactl library and provide find-module for it
1 year ago
configuration-files
add script to disable dsa and then load a new config
1 year ago
util
add option for internal repetitions to benchmarks which allows the small copies of 1kib to run long enough for the timings to become usable (goal is about 1s runtime for each iteration)
11 months ago
CMakeLists.txt
set O3 for release and g3 for debug as build flags in cmake
11 months ago
benchmark.cpp
add option for internal repetitions to benchmarks which allows the small copies of 1kib to run long enough for the timings to become usable (goal is about 1s runtime for each iteration)
11 months ago
benchmarker.sh
pretty-format the benchmarker script which got mangled from editing on vampir
11 months ago
main.cpp
rewrite the benchmarker to not allocate the memory regions each iteration but before the test runs, also flush cache each iteration using dml-operation, also set dsa-device using the parameter to submit and not using libnuma assignment
11 months ago