This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Constantin Fürst e2c6fd8587 set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib) 11 months ago
..
benchmark-descriptors set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib) 11 months ago
benchmark-plots re-evaluate peak perf benchs 11 months ago
benchmark-plotters re-evaluate peak perf benchs 11 months ago
benchmark-results modify benchmarker script to require less parameters and sit in root dir 11 months ago
cmake/modules correctly link the numactl library and provide find-module for it 1 year ago
configuration-files add script to disable dsa and then load a new config 1 year ago
util wait less for task launch and dont write iterations complete out 11 months ago
CMakeLists.txt set O3 for release and g3 for debug as build flags in cmake 11 months ago
benchmark.cpp make benchmark.hpp a cpp file to make it clear that it will have global variables 11 months ago
benchmarker.sh modify benchmarker script to require less parameters and sit in root dir 11 months ago
main.cpp rewrite the benchmark to meassure timings for the entire run of all threads, doing multiple sync-steps with the launch barrier as done in the qdp bench 11 months ago