Constantin Fürst
|
a963406f7c
|
move mode selection to Configuration.hpp, adapt the CopyMethodPolicy-Function to return only src_node for task sizes under 16MiB which is now required to not cause high submission count which slows down small copies
|
11 months ago |
Constantin Fürst
|
ef8286da17
|
unset the weak-wait-flag on deallocation for CacheData which is where completion guarantee is required, also extend some comments in CacheData
|
11 months ago |
Constantin Fürst
|
8dc3827676
|
do not check in the debug benchmark result
|
11 months ago |
Constantin Fürst
|
25451fa26a
|
pretty-format the benchmarker script which got mangled from editing on vampir
|
11 months ago |
Constantin Fürst
|
3bfbeca21f
|
resize source and destination pointer holders properly before use and use path from template and not dml::software for cache flush in benchmark loop
|
11 months ago |
Constantin Fürst
|
ef805244ac
|
use 4gib as size and again 1 aggrj thread for qdp bench
|
11 months ago |
Constantin Fürst
|
3c90e24bc1
|
revert the cacher to allow load balancing control through the copy placement policy function which now selects on how many nodes the task is split again, and not just which nodes the task MAY run on (which was done experimentally)
|
11 months ago |
Constantin Fürst
|
ea978423c0
|
slightly modify the debug benchmark descriptor to contain multiple threads and also a batch
|
11 months ago |
Constantin Fürst
|
cae0fbb56e
|
remove benchmark descriptor handling helper scripts
|
11 months ago |
Constantin Fürst
|
24bdccd1e3
|
rewrite the benchmarker to not allocate the memory regions each iteration but before the test runs, also flush cache each iteration using dml-operation, also set dsa-device using the parameter to submit and not using libnuma assignment
|
11 months ago |
Constantin Fürst
|
e2c6fd8587
|
set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib)
|
11 months ago |
Constantin Fürst
|
4f9abc911f
|
make benchmark.hpp a cpp file to make it clear that it will have global variables
|
11 months ago |
Constantin Fürst
|
f905ee77eb
|
wait less for task launch and dont write iterations complete out
|
11 months ago |
Constantin Fürst
|
72fb3764fc
|
modify benchmarker script to require less parameters and sit in root dir
|
11 months ago |
Constantin Fürst
|
c8b4f3d624
|
fix issues with benchmark.hpp
|
11 months ago |
Constantin Fürst
|
1446d1575d
|
set O3 for release and g3 for debug as build flags in cmake
|
11 months ago |
Constantin Fürst
|
6e26739eb9
|
give unique name to brute cpu copy benchmark descriptors for identification in results
|
11 months ago |
Constantin Fürst
|
a4131a6b33
|
remove previous benchmark results as these will be redone
|
11 months ago |
Constantin Fürst
|
8068fc3437
|
rewrite descriptors to match the format of the rewrite of bench from previous commit
|
11 months ago |
Constantin Fürst
|
fccc255aae
|
rewrite the benchmark to meassure timings for the entire run of all threads, doing multiple sync-steps with the launch barrier as done in the qdp bench
|
11 months ago |
Constantin Fürst
|
d20b6dad93
|
remove all unused descriptors and begin rewriting submission descriptors
|
11 months ago |
Constantin Fürst
|
9d60b57ed7
|
add date '2023' to cppreference bib entries
|
11 months ago |
Constantin Fürst
|
45d7003755
|
fix some compilation-blocking syntax mistakes, improve usage of glossary with short descriptions, use glossary for titles and in figure captions with explicit long/short entries
|
11 months ago |
Constantin Fürst
|
d9c3f5a616
|
escape underscores anywhere found
|
11 months ago |
Constantin Fürst
|
28775abb03
|
add entry for ddr-sdram
|
11 months ago |
Constantin Fürst
|
84c88b8d8e
|
add command for roman numeral generation
|
11 months ago |
Constantin Fürst
|
9ae2cbb500
|
rewrite chapter 2 with clearer style and more consistent grammar, using knowledge gained from the reviews of the subsequent chapters
|
11 months ago |
Constantin Fürst
|
da563e443e
|
reorder some bib entries and add cppreference.com as author of the reference articles
|
11 months ago |
Constantin Fürst
|
cb7b35f8f3
|
apply first set of recommendations from andre for chapter 5, covering up to the first paragraph of 5.2.1
|
11 months ago |
Constantin Fürst
|
a121e7e1e6
|
add unpublished dimes prefetching paper and microsoft reference on numa memory allocation to bib
|
11 months ago |
Constantin Fürst
|
745d8f0eac
|
update the futurework list in the notes for writing chapter 7
|
11 months ago |
Constantin Fürst
|
f2f5dd7791
|
begin writing evaluation chapter
|
11 months ago |
Constantin Fürst
|
aed88029ec
|
apply recommendations from andre for rewriting chapter 4 in regards to grammar, timeform and word usage
|
11 months ago |
Constantin Fürst
|
0d66a89678
|
apply recommendations from andre for rewriting chapter 3 in regards to grammar, timeform and word usage
|
11 months ago |
Constantin Fürst
|
a24feefba2
|
reformulate the section on hbm in chapter 2
|
11 months ago |
Constantin Fürst
|
81527fdb6b
|
commit current vampir config
|
11 months ago |
Constantin Fürst
|
b35f9978ae
|
again, redo the perf-eval with reduced data size and load to prevent missing frames, the second
|
11 months ago |
Constantin Fürst
|
18d5e62b80
|
again, redo the perf-eval with reduced data size and load to prevent missing frames
|
11 months ago |
Constantin Fürst
|
d63d8ac547
|
add redone flame graph
|
11 months ago |
Constantin Fürst
|
69a3d2cef4
|
experimental implementation for tc-scanb > tc-aggrj, the second
|
11 months ago |
Constantin Fürst
|
07fba8a5f0
|
experimental implementation for tc-scanb > tc-aggrj
|
11 months ago |
Constantin Fürst
|
d4122ba25a
|
add updated config for prefetch from vampir
|
11 months ago |
Constantin Fürst
|
e4a0030049
|
fix prefetching subchunk indexing and adapt the weak access flag for join
|
11 months ago |
Constantin Fürst
|
322eb43cbc
|
add flag for early abort in access if no entry is present
|
11 months ago |
Constantin Fürst
|
f978d6b9b4
|
redo tests for prefetching
|
11 months ago |
Constantin Fürst
|
972440d19f
|
repair flags implementation
|
11 months ago |
Constantin Fürst
|
b3607329a6
|
add a flags-concept to cacher, add the option to select whether to handle pagefaults or not
|
11 months ago |
Constantin Fürst
|
4b0770fc8e
|
add result for try with strong waiting
|
11 months ago |
Constantin Fürst
|
6dd7f80500
|
again, redo the perf flame graph
|
11 months ago |
Constantin Fürst
|
29c49ca5b4
|
redo flame graph with correct stack information
|
11 months ago |