This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Constantin Fürst a963406f7c move mode selection to Configuration.hpp, adapt the CopyMethodPolicy-Function to return only src_node for task sizes under 16MiB which is now required to not cause high submission count which slows down small copies 11 months ago
..
engine-location-bench rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
mtsubmit-bench rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
peak-perf-allnodes rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
peak-perf-allnodes-cpu rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
peak-perf-brute-cpu give unique name to brute cpu copy benchmark descriptors for identification in results 11 months ago
peak-perf-smart rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
submit-bench set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib) 11 months ago
copy-debug-n0ton0-cpu.json slightly modify the debug benchmark descriptor to contain multiple threads and also a batch 11 months ago