Constantin Fürst
|
24bdccd1e3
|
rewrite the benchmarker to not allocate the memory regions each iteration but before the test runs, also flush cache each iteration using dml-operation, also set dsa-device using the parameter to submit and not using libnuma assignment
|
11 months ago |
Constantin Fürst
|
e2c6fd8587
|
set iteration count for submission bench to 10 for the large and 100 for the small, also test 128mib which is over the size of the cache available on xeonmax (instead of 32mib)
|
11 months ago |
Constantin Fürst
|
4f9abc911f
|
make benchmark.hpp a cpp file to make it clear that it will have global variables
|
11 months ago |
Constantin Fürst
|
f905ee77eb
|
wait less for task launch and dont write iterations complete out
|
11 months ago |
Constantin Fürst
|
72fb3764fc
|
modify benchmarker script to require less parameters and sit in root dir
|
11 months ago |
Constantin Fürst
|
c8b4f3d624
|
fix issues with benchmark.hpp
|
11 months ago |
Constantin Fürst
|
1446d1575d
|
set O3 for release and g3 for debug as build flags in cmake
|
11 months ago |
Constantin Fürst
|
6e26739eb9
|
give unique name to brute cpu copy benchmark descriptors for identification in results
|
11 months ago |
Constantin Fürst
|
a4131a6b33
|
remove previous benchmark results as these will be redone
|
11 months ago |
Constantin Fürst
|
8068fc3437
|
rewrite descriptors to match the format of the rewrite of bench from previous commit
|
11 months ago |
Constantin Fürst
|
fccc255aae
|
rewrite the benchmark to meassure timings for the entire run of all threads, doing multiple sync-steps with the launch barrier as done in the qdp bench
|
11 months ago |
Constantin Fürst
|
d20b6dad93
|
remove all unused descriptors and begin rewriting submission descriptors
|
11 months ago |
Constantin Fürst
|
9d60b57ed7
|
add date '2023' to cppreference bib entries
|
11 months ago |
Constantin Fürst
|
45d7003755
|
fix some compilation-blocking syntax mistakes, improve usage of glossary with short descriptions, use glossary for titles and in figure captions with explicit long/short entries
|
11 months ago |
Constantin Fürst
|
d9c3f5a616
|
escape underscores anywhere found
|
11 months ago |
Constantin Fürst
|
28775abb03
|
add entry for ddr-sdram
|
11 months ago |
Constantin Fürst
|
84c88b8d8e
|
add command for roman numeral generation
|
11 months ago |
Constantin Fürst
|
9ae2cbb500
|
rewrite chapter 2 with clearer style and more consistent grammar, using knowledge gained from the reviews of the subsequent chapters
|
11 months ago |
Constantin Fürst
|
da563e443e
|
reorder some bib entries and add cppreference.com as author of the reference articles
|
11 months ago |
Constantin Fürst
|
cb7b35f8f3
|
apply first set of recommendations from andre for chapter 5, covering up to the first paragraph of 5.2.1
|
11 months ago |
Constantin Fürst
|
a121e7e1e6
|
add unpublished dimes prefetching paper and microsoft reference on numa memory allocation to bib
|
11 months ago |
Constantin Fürst
|
745d8f0eac
|
update the futurework list in the notes for writing chapter 7
|
11 months ago |
Constantin Fürst
|
f2f5dd7791
|
begin writing evaluation chapter
|
11 months ago |
Constantin Fürst
|
aed88029ec
|
apply recommendations from andre for rewriting chapter 4 in regards to grammar, timeform and word usage
|
11 months ago |
Constantin Fürst
|
0d66a89678
|
apply recommendations from andre for rewriting chapter 3 in regards to grammar, timeform and word usage
|
11 months ago |
Constantin Fürst
|
a24feefba2
|
reformulate the section on hbm in chapter 2
|
11 months ago |
Constantin Fürst
|
81527fdb6b
|
commit current vampir config
|
11 months ago |
Constantin Fürst
|
b35f9978ae
|
again, redo the perf-eval with reduced data size and load to prevent missing frames, the second
|
11 months ago |
Constantin Fürst
|
18d5e62b80
|
again, redo the perf-eval with reduced data size and load to prevent missing frames
|
11 months ago |
Constantin Fürst
|
d63d8ac547
|
add redone flame graph
|
11 months ago |
Constantin Fürst
|
69a3d2cef4
|
experimental implementation for tc-scanb > tc-aggrj, the second
|
11 months ago |
Constantin Fürst
|
07fba8a5f0
|
experimental implementation for tc-scanb > tc-aggrj
|
11 months ago |
Constantin Fürst
|
d4122ba25a
|
add updated config for prefetch from vampir
|
11 months ago |
Constantin Fürst
|
e4a0030049
|
fix prefetching subchunk indexing and adapt the weak access flag for join
|
11 months ago |
Constantin Fürst
|
322eb43cbc
|
add flag for early abort in access if no entry is present
|
11 months ago |
Constantin Fürst
|
f978d6b9b4
|
redo tests for prefetching
|
11 months ago |
Constantin Fürst
|
972440d19f
|
repair flags implementation
|
11 months ago |
Constantin Fürst
|
b3607329a6
|
add a flags-concept to cacher, add the option to select whether to handle pagefaults or not
|
11 months ago |
Constantin Fürst
|
4b0770fc8e
|
add result for try with strong waiting
|
11 months ago |
Constantin Fürst
|
6dd7f80500
|
again, redo the perf flame graph
|
11 months ago |
Constantin Fürst
|
29c49ca5b4
|
redo flame graph with correct stack information
|
11 months ago |
Constantin Fürst
|
4cbe649601
|
generate flame graph for runtime of prefetch
|
11 months ago |
Constantin Fürst
|
57e696297c
|
provide new results for simpleq
|
11 months ago |
Constantin Fürst
|
bb1d20924a
|
fix index clash for thread-and-group unique indexing
|
11 months ago |
Constantin Fürst
|
0eca180e53
|
fix destination indexing in aggrj for happly
|
11 months ago |
Constantin Fürst
|
5e8f3e05e3
|
fix chunk indexing in scanb and refactor result calculation
|
11 months ago |
Constantin Fürst
|
c2b9e6656d
|
fix chunk selection in scanb, use the dataptr in aggrj complex mode, export some functions to src/utils/BenchmarkHelpers.cpp
|
11 months ago |
Constantin Fürst
|
845e812ca7
|
set the correct sum check which was inverted by querry type
|
11 months ago |
Constantin Fürst
|
abcb9a4b2e
|
extend modestring to contain query type
|
11 months ago |
Constantin Fürst
|
e4ed4ac5b9
|
correct and minimize subchunking implementation which now is only allowed in scanb
|
11 months ago |