349 Commits (3bfbeca21f3b1d5e6e9610b2a89e54d365345991)
 

Author SHA1 Message Date
Constantin Fürst 3bfbeca21f resize source and destination pointer holders properly before use and use path from template and not dml::software for cache flush in benchmark loop 11 months ago
Constantin Fürst ef805244ac use 4gib as size and again 1 aggrj thread for qdp bench 11 months ago
Constantin Fürst 3c90e24bc1 revert the cacher to allow load balancing control through the copy placement policy function which now selects on how many nodes the task is split again, and not just which nodes the task MAY run on (which was done experimentally) 11 months ago
Constantin Fürst ea978423c0 slightly modify the debug benchmark descriptor to contain multiple threads and also a batch 11 months ago
Constantin Fürst cae0fbb56e remove benchmark descriptor handling helper scripts 11 months ago
Constantin Fürst 24bdccd1e3 rewrite the benchmarker to not allocate the memory regions each iteration but before the test runs, also flush cache each iteration using dml-operation, also set dsa-device using the parameter to submit and not using libnuma assignment 11 months ago
Constantin Fürst e2c6fd8587 set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib) 11 months ago
Constantin Fürst 4f9abc911f make benchmark.hpp a cpp file to make it clear that it will have global variables 11 months ago
Constantin Fürst f905ee77eb wait less for task launch and dont write iterations complete out 11 months ago
Constantin Fürst 72fb3764fc modify benchmarker script to require less parameters and sit in root dir 11 months ago
Constantin Fürst c8b4f3d624 fix issues with benchmark.hpp 11 months ago
Constantin Fürst 1446d1575d set O3 for release and g3 for debug as build flags in cmake 11 months ago
Constantin Fürst 6e26739eb9 give unique name to brute cpu copy benchmark descriptors for identification in results 11 months ago
Constantin Fürst a4131a6b33 remove previous benchmark results as these will be redone 11 months ago
Constantin Fürst 8068fc3437 rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
Constantin Fürst fccc255aae rewrite the benchmark to meassure timings for the entire run of all threads, doing multiple sync-steps with the launch barrier as done in the qdp bench 11 months ago
Constantin Fürst d20b6dad93 remove all unused descriptors and begin rewriting submission descriptors 11 months ago
Constantin Fürst 9d60b57ed7 add date '2023' to cppreference bib entries 11 months ago
Constantin Fürst 45d7003755 fix some compilation-blocking syntax mistakes, improve usage of glossary with short descriptions, use glossary for titles and in figure captions with explicit long/short entries 11 months ago
Constantin Fürst d9c3f5a616 escape underscores anywhere found 11 months ago
Constantin Fürst 28775abb03 add entry for ddr-sdram 11 months ago
Constantin Fürst 84c88b8d8e add command for roman numeral generation 11 months ago
Constantin Fürst 9ae2cbb500 rewrite chapter 2 with clearer style and more consistent grammar, using knowledge gained from the reviews of the subsequent chapters 11 months ago
Constantin Fürst da563e443e reorder some bib entries and add cppreference.com as author of the reference articles 11 months ago
Constantin Fürst cb7b35f8f3 apply first set of recommendations from andre for chapter 5, covering up to the first paragraph of 5.2.1 11 months ago
Constantin Fürst a121e7e1e6 add unpublished dimes prefetching paper and microsoft reference on numa memory allocation to bib 11 months ago
Constantin Fürst 745d8f0eac update the futurework list in the notes for writing chapter 7 11 months ago
Constantin Fürst f2f5dd7791 begin writing evaluation chapter 11 months ago
Constantin Fürst aed88029ec apply recommendations from andre for rewriting chapter 4 in regards to grammar, timeform and word usage 11 months ago
Constantin Fürst 0d66a89678 apply recommendations from andre for rewriting chapter 3 in regards to grammar, timeform and word usage 11 months ago
Constantin Fürst a24feefba2 reformulate the section on hbm in chapter 2 11 months ago
Constantin Fürst 81527fdb6b commit current vampir config 11 months ago
Constantin Fürst b35f9978ae again, redo the perf-eval with reduced data size and load to prevent missing frames, the second 11 months ago
Constantin Fürst 18d5e62b80 again, redo the perf-eval with reduced data size and load to prevent missing frames 11 months ago
Constantin Fürst d63d8ac547 add redone flame graph 11 months ago
Constantin Fürst 69a3d2cef4 experimental implementation for tc-scanb > tc-aggrj, the second 11 months ago
Constantin Fürst 07fba8a5f0 experimental implementation for tc-scanb > tc-aggrj 11 months ago
Constantin Fürst d4122ba25a add updated config for prefetch from vampir 11 months ago
Constantin Fürst e4a0030049 fix prefetching subchunk indexing and adapt the weak access flag for join 11 months ago
Constantin Fürst 322eb43cbc add flag for early abort in access if no entry is present 11 months ago
Constantin Fürst f978d6b9b4 redo tests for prefetching 11 months ago
Constantin Fürst 972440d19f repair flags implementation 11 months ago
Constantin Fürst b3607329a6 add a flags-concept to cacher, add the option to select whether to handle pagefaults or not 11 months ago
Constantin Fürst 4b0770fc8e add result for try with strong waiting 11 months ago
Constantin Fürst 6dd7f80500 again, redo the perf flame graph 11 months ago
Constantin Fürst 29c49ca5b4 redo flame graph with correct stack information 11 months ago
Constantin Fürst 4cbe649601 generate flame graph for runtime of prefetch 11 months ago
Constantin Fürst 57e696297c provide new results for simpleq 11 months ago
Constantin Fürst bb1d20924a fix index clash for thread-and-group unique indexing 11 months ago
Constantin Fürst 0eca180e53 fix destination indexing in aggrj for happly 11 months ago