Constantin Fürst
|
52a026805e
|
use different notation for glossary which results in easier reading and correct result
|
12 months ago |
Constantin Fürst
|
c0c75aa51b
|
use uniform colormap in the plots and use separate output folder for the plots
|
12 months ago |
Constantin Fürst
|
5f72404508
|
re run changed smart peak throughput benchmarks
|
1 year ago |
Constantin Fürst
|
1fca956a0a
|
update the peak throughput plotter to show the difference of smart and allnodes too
|
1 year ago |
Constantin Fürst
|
791184ff10
|
fix mistake in benchmark descriptors for smart peak performance
|
1 year ago |
Constantin Fürst
|
f809eb5847
|
rerun copy benchmark with smart assignment
|
1 year ago |
Constantin Fürst
|
6b9581b8d1
|
add results for the new 512mib internode hbm copy peak perf test
|
1 year ago |
Constantin Fürst
|
0748826fcd
|
use transfer size of 512mib for HBM intranode copy and modify plotter accordingly
|
1 year ago |
Constantin Fürst
|
60a5ba5120
|
refactor the benchmark plotters and submit newly plotted graphs
|
1 year ago |
Constantin Fürst
|
c0f2aa2b64
|
re-plot the benchmarks with the new data
|
1 year ago |
Constantin Fürst
|
1c7369f20e
|
add results for smart and allnodes peak throughput benchmark
|
1 year ago |
Constantin Fürst
|
1e55565072
|
remove previous benchmark results for peak performance
|
1 year ago |
Constantin Fürst
|
475b2d5b5a
|
provide two benchmarks for peak performance, one that is brute force and one that uses smart node assignment and therefore lower utilization
|
1 year ago |
Constantin Fürst
|
9cef69c33f
|
add mdsa v3 benchmark results for peak performance
|
1 year ago |
Constantin Fürst
|
7ced0bce4c
|
fix an issue in the python script that lead to references being modified which caused bad node settings
|
1 year ago |
Constantin Fürst
|
8787b441bc
|
add second type of multi dsa benchmark results
|
1 year ago |
Constantin Fürst
|
6d9002d1e7
|
use different engine configuration depending on whether intra socket (all 4 engines on the socket) or inter socket (src and destination engine, cross copy) is the copy type
|
1 year ago |
Constantin Fürst
|
68a838f0d1
|
add final multi-dsa results
|
1 year ago |
Constantin Fürst
|
c92bb28d9a
|
add intermediate multi-dsa results
|
1 year ago |
Constantin Fürst
|
f11bb710ae
|
add intermediate multi-dsa results
|
1 year ago |
Constantin Fürst
|
7f7230197c
|
dont submit multiple copies - test takes too long and this has almost no effect at work size of 1gib
|
1 year ago |
Constantin Fürst
|
3e102509a9
|
add intermediate multi-dsa results
|
1 year ago |
Constantin Fürst
|
584c5bdfc4
|
add intermediate multi-dsa results
|
1 year ago |
Constantin Fürst
|
e9807df09c
|
use all 8 engines for each copy task, as the engine location does not affect performance
|
1 year ago |
Constantin Fürst
|
5089936f30
|
prepare peak throughput plotter for multi-node results
|
1 year ago |
Constantin Fürst
|
db11eb60e6
|
add 4e results from copy peak perf bench
|
1 year ago |
Constantin Fürst
|
17264186a6
|
small changes to the plotter scripts for nicer display
|
1 year ago |
Constantin Fürst
|
1682b84fb4
|
use a batch size of 8 to check whether multiple engines can increase throughput
|
1 year ago |
Constantin Fürst
|
4598cedd40
|
re-run peak perf test
|
1 year ago |
Constantin Fürst
|
575ff8cf82
|
turn node -1 into node 7 again - this got messed up by a hastily written modification script
|
1 year ago |
Constantin Fürst
|
eb4ea5162d
|
fix bugs that were introduced by changes to the plotter scripts
|
1 year ago |
Constantin Fürst
|
cf675e37f5
|
dont pin the thread to hbm nodes but to the hbm src node minus 8 in peak perf benchmark
|
1 year ago |
Constantin Fürst
|
d3e8fec087
|
re-run engine location bench
|
1 year ago |
Constantin Fürst
|
5c3b008620
|
add results for peak, mt and submit benchmarks
|
1 year ago |
Constantin Fürst
|
788b2f25d3
|
add new results for engine location benchmark
|
1 year ago |
Constantin Fürst
|
808c8f3ae7
|
run the engine location benchmarks with size of 1gib only 10 times
|
1 year ago |
Constantin Fürst
|
806f5f4f97
|
remove old benchmark results
|
1 year ago |
Constantin Fürst
|
099f454f19
|
modify plotters to a more streamlined state, all now use the file-loop in main and have a function that processes one file into the dataset, also adds the peakthroughput plotter and removes the defunct opt-submitmethod plotter
|
1 year ago |
Constantin Fürst
|
b37968dd3f
|
rename engine location benchmarks, modify plotter to support missing configuration files as the new cases are not universally applicable to all configurations
|
1 year ago |
Constantin Fürst
|
6cde7288e9
|
use total time in submitmethod benchmark too
|
1 year ago |
Constantin Fürst
|
a548d9afe5
|
restructure mtsubmit benchmarks
|
1 year ago |
Constantin Fürst
|
b7cae18b6d
|
restructure the engine location bench, correct and update the plotter to use new total time
|
1 year ago |
Constantin Fürst
|
405166cbe8
|
add peak perf benchmark descriptors
|
1 year ago |
Constantin Fürst
|
148c4c213a
|
re-run mtsubmit tests
|
1 year ago |
Constantin Fürst
|
fb9164ae89
|
add script to disable dsa and then load a new config
|
1 year ago |
Constantin Fürst
|
9886b20112
|
remove the multiple tests for mtsubmit and ensure that the wq will always receive the same elements to make the test fair
|
1 year ago |
Constantin Fürst
|
01850cf97b
|
add new mtsubmit test results
|
1 year ago |
Constantin Fürst
|
3964da0d7a
|
use ms10 instead of ms50 as ms50 with > 2 threads will overfill the wq of size 128 (max) and cause an error unhandled in the benchmark yet
|
1 year ago |
Constantin Fürst
|
f9b00a5b32
|
modify mtsubmit to meassure both ssaw and ms50 submit methods
|
1 year ago |
Constantin Fürst
|
846fa9be43
|
re-run mtsubmit benchmarks for 4e
|
1 year ago |