This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Constantin Fürst a963406f7c move mode selection to Configuration.hpp, adapt the CopyMethodPolicy-Function to return only src_node for task sizes under 16MiB which is now required to not cause high submission count which slows down small copies 11 months ago
..
benchmark-descriptors slightly modify the debug benchmark descriptor to contain multiple threads and also a batch 11 months ago
benchmark-plots re-evaluate peak perf benchs 11 months ago
benchmark-plotters re-evaluate peak perf benchs 11 months ago
cmake/modules correctly link the numactl library and provide find-module for it 1 year ago
configuration-files add script to disable dsa and then load a new config 1 year ago
util wait less for task launch and dont write iterations complete out 11 months ago
CMakeLists.txt set O3 for release and g3 for debug as build flags in cmake 11 months ago
benchmark.cpp resize source and destination pointer holders properly before use and use path from template and not dml::software for cache flush in benchmark loop 11 months ago
benchmarker.sh pretty-format the benchmarker script which got mangled from editing on vampir 11 months ago
main.cpp rewrite the benchmarker to not allocate the memory regions each iteration but before the test runs, also flush cache each iteration using dml-operation, also set dsa-device using the parameter to submit and not using libnuma assignment 11 months ago