This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 
Constantin Fürst c01eafedae refactor the cacher to reduce complexity, removes the access guarantees (relaxed,immediate,...), uses the fact that other tasks will wait on atomic value change for the cache-pointer if it is nullptr to add the entry to cache structure earlier reducing cost of two threads accessing new entry at the same time, splits the offloading-cache.hpp file into two with one containing the data-class (represents a cache entry and task) and the other containing the cacher itself 12 months ago
..
engine-location-bench run the engine location benchmarks with size of 1gib only 10 times 1 year ago
mtsubmit-bench restructure mtsubmit benchmarks 1 year ago
peak-perf-allnodes use transfer size of 512mib for HBM intranode copy and modify plotter accordingly 1 year ago
peak-perf-smart fix mistake in benchmark descriptors for smart peak performance 1 year ago
submit-bench set repetition count of tests to 1000 again and replace the 1 GiB tests with 32 MiB 1 year ago
benchmarker.sh fix the benchmarker script by testing for the correct amount of parameters and obtaining the test name correctly 1 year ago
modifier.py fix mistake in benchmark descriptors for smart peak performance 1 year ago