538 Commits (master)
 

Author SHA1 Message Date
Constantin Fürst 52a026805e use different notation for glossary which results in easier reading and correct result 12 months ago
Constantin Fürst c0c75aa51b use uniform colormap in the plots and use separate output folder for the plots 12 months ago
Constantin Fürst 5f72404508 re run changed smart peak throughput benchmarks 1 year ago
Constantin Fürst 1fca956a0a update the peak throughput plotter to show the difference of smart and allnodes too 1 year ago
Constantin Fürst 791184ff10 fix mistake in benchmark descriptors for smart peak performance 1 year ago
Constantin Fürst f809eb5847 rerun copy benchmark with smart assignment 1 year ago
Constantin Fürst 6b9581b8d1 add results for the new 512mib internode hbm copy peak perf test 1 year ago
Constantin Fürst 0748826fcd use transfer size of 512mib for HBM intranode copy and modify plotter accordingly 1 year ago
Constantin Fürst 60a5ba5120 refactor the benchmark plotters and submit newly plotted graphs 1 year ago
Constantin Fürst c0f2aa2b64 re-plot the benchmarks with the new data 1 year ago
Constantin Fürst 1c7369f20e add results for smart and allnodes peak throughput benchmark 1 year ago
Constantin Fürst 1e55565072 remove previous benchmark results for peak performance 1 year ago
Constantin Fürst 475b2d5b5a provide two benchmarks for peak performance, one that is brute force and one that uses smart node assignment and therefore lower utilization 1 year ago
Constantin Fürst 9cef69c33f add mdsa v3 benchmark results for peak performance 1 year ago
Constantin Fürst 7ced0bce4c fix an issue in the python script that lead to references being modified which caused bad node settings 1 year ago
Constantin Fürst 8787b441bc add second type of multi dsa benchmark results 1 year ago
Constantin Fürst 6d9002d1e7 use different engine configuration depending on whether intra socket (all 4 engines on the socket) or inter socket (src and destination engine, cross copy) is the copy type 1 year ago
Constantin Fürst 68a838f0d1 add final multi-dsa results 1 year ago
Constantin Fürst c92bb28d9a add intermediate multi-dsa results 1 year ago
Constantin Fürst f11bb710ae add intermediate multi-dsa results 1 year ago
Constantin Fürst 7f7230197c dont submit multiple copies - test takes too long and this has almost no effect at work size of 1gib 1 year ago
Constantin Fürst 3e102509a9 add intermediate multi-dsa results 1 year ago
Constantin Fürst 584c5bdfc4 add intermediate multi-dsa results 1 year ago
Constantin Fürst e9807df09c use all 8 engines for each copy task, as the engine location does not affect performance 1 year ago
Constantin Fürst 5089936f30 prepare peak throughput plotter for multi-node results 1 year ago
Constantin Fürst db11eb60e6 add 4e results from copy peak perf bench 1 year ago
Constantin Fürst 17264186a6 small changes to the plotter scripts for nicer display 1 year ago
Constantin Fürst 1682b84fb4 use a batch size of 8 to check whether multiple engines can increase throughput 1 year ago
Constantin Fürst 4598cedd40 re-run peak perf test 1 year ago
Constantin Fürst 575ff8cf82 turn node -1 into node 7 again - this got messed up by a hastily written modification script 1 year ago
Constantin Fürst eb4ea5162d fix bugs that were introduced by changes to the plotter scripts 1 year ago
Constantin Fürst cf675e37f5 dont pin the thread to hbm nodes but to the hbm src node minus 8 in peak perf benchmark 1 year ago
Constantin Fürst d3e8fec087 re-run engine location bench 1 year ago
Constantin Fürst 5c3b008620 add results for peak, mt and submit benchmarks 1 year ago
Constantin Fürst 788b2f25d3 add new results for engine location benchmark 1 year ago
Constantin Fürst 808c8f3ae7 run the engine location benchmarks with size of 1gib only 10 times 1 year ago
Constantin Fürst 806f5f4f97 remove old benchmark results 1 year ago
Constantin Fürst 099f454f19 modify plotters to a more streamlined state, all now use the file-loop in main and have a function that processes one file into the dataset, also adds the peakthroughput plotter and removes the defunct opt-submitmethod plotter 1 year ago
Constantin Fürst b37968dd3f rename engine location benchmarks, modify plotter to support missing configuration files as the new cases are not universally applicable to all configurations 1 year ago
Constantin Fürst 6cde7288e9 use total time in submitmethod benchmark too 1 year ago
Constantin Fürst a548d9afe5 restructure mtsubmit benchmarks 1 year ago
Constantin Fürst b7cae18b6d restructure the engine location bench, correct and update the plotter to use new total time 1 year ago
Constantin Fürst 405166cbe8 add peak perf benchmark descriptors 1 year ago
Constantin Fürst 148c4c213a re-run mtsubmit tests 1 year ago
Constantin Fürst fb9164ae89 add script to disable dsa and then load a new config 1 year ago
Constantin Fürst 9886b20112 remove the multiple tests for mtsubmit and ensure that the wq will always receive the same elements to make the test fair 1 year ago
Constantin Fürst 01850cf97b add new mtsubmit test results 1 year ago
Constantin Fürst 3964da0d7a use ms10 instead of ms50 as ms50 with > 2 threads will overfill the wq of size 128 (max) and cause an error unhandled in the benchmark yet 1 year ago
Constantin Fürst f9b00a5b32 modify mtsubmit to meassure both ssaw and ms50 submit methods 1 year ago
Constantin Fürst 846fa9be43 re-run mtsubmit benchmarks for 4e 1 year ago