464 Commits (faa5114ba849d4a3005276bca04f8695a90ba802)
 

Author SHA1 Message Date
Constantin Fürst c6495b8b02 write introductory paragraph to design section 11 months ago
Constantin Fürst d1cc3e3b0c modification to qdp benchmark, returns to per-chunk barrier wait, uses userspace semaphore for one-way barrier from scan_b to aggr_j as scan_b should submit asap but aggr_j should wait on submission from scan_b, contains TODO for modifying code to support chunkcount not divisible by 2 11 months ago
Constantin Fürst bf79435ff0 add new benchmark plots from the rewritten microbench 11 months ago
Constantin Fürst 8ab5eb4902 finish adapting plotters to new result style, add division by thread count to the throughput plotters, adjust figure sizes to be small (larger font when scaled up in latex) 11 months ago
Constantin Fürst f3e89405a5 publish benchmark results from vampir for the redone microbench 11 months ago
Constantin Fürst 067a31e560 modify submission benchmark descriptors to have x10 internal repetitions 11 months ago
Constantin Fürst 326cf92af3 update benchmark plotters for changes made to benchmark and result format 11 months ago
Constantin Fürst 875098b258 remove plotters which are not in use anymore 11 months ago
Constantin Fürst ed77e57e5f remove benchmark descriptors for unused engine location benchmark 11 months ago
Constantin Fürst a216d96003 remove old benchmark plots 11 months ago
Constantin Fürst 8ac601fc07 add option for internal repetitions to benchmarks which allows the small copies of 1kib to run long enough for the timings to become usable (goal is about 1s runtime for each iteration) 11 months ago
Constantin Fürst a963406f7c move mode selection to Configuration.hpp, adapt the CopyMethodPolicy-Function to return only src_node for task sizes under 16MiB which is now required to not cause high submission count which slows down small copies 11 months ago
Constantin Fürst ef8286da17 unset the weak-wait-flag on deallocation for CacheData which is where completion guarantee is required, also extend some comments in CacheData 11 months ago
Constantin Fürst 8dc3827676 do not check in the debug benchmark result 11 months ago
Constantin Fürst 25451fa26a pretty-format the benchmarker script which got mangled from editing on vampir 11 months ago
Constantin Fürst 3bfbeca21f resize source and destination pointer holders properly before use and use path from template and not dml::software for cache flush in benchmark loop 11 months ago
Constantin Fürst ef805244ac use 4gib as size and again 1 aggrj thread for qdp bench 11 months ago
Constantin Fürst 3c90e24bc1 revert the cacher to allow load balancing control through the copy placement policy function which now selects on how many nodes the task is split again, and not just which nodes the task MAY run on (which was done experimentally) 11 months ago
Constantin Fürst ea978423c0 slightly modify the debug benchmark descriptor to contain multiple threads and also a batch 11 months ago
Constantin Fürst cae0fbb56e remove benchmark descriptor handling helper scripts 11 months ago
Constantin Fürst 24bdccd1e3 rewrite the benchmarker to not allocate the memory regions each iteration but before the test runs, also flush cache each iteration using dml-operation, also set dsa-device using the parameter to submit and not using libnuma assignment 11 months ago
Constantin Fürst e2c6fd8587 set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib) 11 months ago
Constantin Fürst 4f9abc911f make benchmark.hpp a cpp file to make it clear that it will have global variables 11 months ago
Constantin Fürst f905ee77eb wait less for task launch and dont write iterations complete out 11 months ago
Constantin Fürst 72fb3764fc modify benchmarker script to require less parameters and sit in root dir 11 months ago
Constantin Fürst c8b4f3d624 fix issues with benchmark.hpp 11 months ago
Constantin Fürst 1446d1575d set O3 for release and g3 for debug as build flags in cmake 11 months ago
Constantin Fürst 6e26739eb9 give unique name to brute cpu copy benchmark descriptors for identification in results 11 months ago
Constantin Fürst a4131a6b33 remove previous benchmark results as these will be redone 11 months ago
Constantin Fürst 8068fc3437 rewrite descriptors to match the format of the rewrite of bench from previous commit 11 months ago
Constantin Fürst fccc255aae rewrite the benchmark to meassure timings for the entire run of all threads, doing multiple sync-steps with the launch barrier as done in the qdp bench 11 months ago
Constantin Fürst d20b6dad93 remove all unused descriptors and begin rewriting submission descriptors 11 months ago
Constantin Fürst 9d60b57ed7 add date '2023' to cppreference bib entries 11 months ago
Constantin Fürst 45d7003755 fix some compilation-blocking syntax mistakes, improve usage of glossary with short descriptions, use glossary for titles and in figure captions with explicit long/short entries 11 months ago
Constantin Fürst d9c3f5a616 escape underscores anywhere found 11 months ago
Constantin Fürst 28775abb03 add entry for ddr-sdram 11 months ago
Constantin Fürst 84c88b8d8e add command for roman numeral generation 11 months ago
Constantin Fürst 9ae2cbb500 rewrite chapter 2 with clearer style and more consistent grammar, using knowledge gained from the reviews of the subsequent chapters 11 months ago
Constantin Fürst da563e443e reorder some bib entries and add cppreference.com as author of the reference articles 11 months ago
Constantin Fürst cb7b35f8f3 apply first set of recommendations from andre for chapter 5, covering up to the first paragraph of 5.2.1 11 months ago
Constantin Fürst a121e7e1e6 add unpublished dimes prefetching paper and microsoft reference on numa memory allocation to bib 11 months ago
Constantin Fürst 745d8f0eac update the futurework list in the notes for writing chapter 7 11 months ago
Constantin Fürst f2f5dd7791 begin writing evaluation chapter 11 months ago
Constantin Fürst aed88029ec apply recommendations from andre for rewriting chapter 4 in regards to grammar, timeform and word usage 11 months ago
Constantin Fürst 0d66a89678 apply recommendations from andre for rewriting chapter 3 in regards to grammar, timeform and word usage 11 months ago
Constantin Fürst a24feefba2 reformulate the section on hbm in chapter 2 11 months ago
Constantin Fürst 81527fdb6b commit current vampir config 11 months ago
Constantin Fürst b35f9978ae again, redo the perf-eval with reduced data size and load to prevent missing frames, the second 11 months ago
Constantin Fürst 18d5e62b80 again, redo the perf-eval with reduced data size and load to prevent missing frames 11 months ago
Constantin Fürst d63d8ac547 add redone flame graph 11 months ago