336 Commits (a4131a6b3346c03e98d762510f699fc079af16ec)
 

Author SHA1 Message Date
Constantin Fürst 5f72404508 re run changed smart peak throughput benchmarks 1 year ago
Constantin Fürst 1fca956a0a update the peak throughput plotter to show the difference of smart and allnodes too 1 year ago
Constantin Fürst 791184ff10 fix mistake in benchmark descriptors for smart peak performance 1 year ago
Constantin Fürst f809eb5847 rerun copy benchmark with smart assignment 1 year ago
Constantin Fürst 6b9581b8d1 add results for the new 512mib internode hbm copy peak perf test 1 year ago
Constantin Fürst 0748826fcd use transfer size of 512mib for HBM intranode copy and modify plotter accordingly 1 year ago
Constantin Fürst 60a5ba5120 refactor the benchmark plotters and submit newly plotted graphs 1 year ago
Constantin Fürst c0f2aa2b64 re-plot the benchmarks with the new data 1 year ago
Constantin Fürst 1c7369f20e add results for smart and allnodes peak throughput benchmark 1 year ago
Constantin Fürst 1e55565072 remove previous benchmark results for peak performance 1 year ago
Constantin Fürst 475b2d5b5a provide two benchmarks for peak performance, one that is brute force and one that uses smart node assignment and therefore lower utilization 1 year ago
Constantin Fürst 9cef69c33f add mdsa v3 benchmark results for peak performance 1 year ago
Constantin Fürst 7ced0bce4c fix an issue in the python script that lead to references being modified which caused bad node settings 1 year ago
Constantin Fürst 8787b441bc add second type of multi dsa benchmark results 1 year ago
Constantin Fürst 6d9002d1e7 use different engine configuration depending on whether intra socket (all 4 engines on the socket) or inter socket (src and destination engine, cross copy) is the copy type 1 year ago
Constantin Fürst 68a838f0d1 add final multi-dsa results 1 year ago
Constantin Fürst c92bb28d9a add intermediate multi-dsa results 1 year ago
Constantin Fürst f11bb710ae add intermediate multi-dsa results 1 year ago
Constantin Fürst 7f7230197c dont submit multiple copies - test takes too long and this has almost no effect at work size of 1gib 1 year ago
Constantin Fürst 3e102509a9 add intermediate multi-dsa results 1 year ago
Constantin Fürst 584c5bdfc4 add intermediate multi-dsa results 1 year ago
Constantin Fürst e9807df09c use all 8 engines for each copy task, as the engine location does not affect performance 1 year ago
Constantin Fürst 5089936f30 prepare peak throughput plotter for multi-node results 1 year ago
Constantin Fürst db11eb60e6 add 4e results from copy peak perf bench 1 year ago
Constantin Fürst 17264186a6 small changes to the plotter scripts for nicer display 1 year ago
Constantin Fürst 1682b84fb4 use a batch size of 8 to check whether multiple engines can increase throughput 1 year ago
Constantin Fürst 4598cedd40 re-run peak perf test 1 year ago
Constantin Fürst 575ff8cf82 turn node -1 into node 7 again - this got messed up by a hastily written modification script 1 year ago
Constantin Fürst eb4ea5162d fix bugs that were introduced by changes to the plotter scripts 1 year ago
Constantin Fürst cf675e37f5 dont pin the thread to hbm nodes but to the hbm src node minus 8 in peak perf benchmark 1 year ago
Constantin Fürst d3e8fec087 re-run engine location bench 1 year ago
Constantin Fürst 5c3b008620 add results for peak, mt and submit benchmarks 1 year ago
Constantin Fürst 788b2f25d3 add new results for engine location benchmark 1 year ago
Constantin Fürst 808c8f3ae7 run the engine location benchmarks with size of 1gib only 10 times 1 year ago
Constantin Fürst 806f5f4f97 remove old benchmark results 1 year ago
Constantin Fürst 099f454f19 modify plotters to a more streamlined state, all now use the file-loop in main and have a function that processes one file into the dataset, also adds the peakthroughput plotter and removes the defunct opt-submitmethod plotter 1 year ago
Constantin Fürst b37968dd3f rename engine location benchmarks, modify plotter to support missing configuration files as the new cases are not universally applicable to all configurations 1 year ago
Constantin Fürst 6cde7288e9 use total time in submitmethod benchmark too 1 year ago
Constantin Fürst a548d9afe5 restructure mtsubmit benchmarks 1 year ago
Constantin Fürst b7cae18b6d restructure the engine location bench, correct and update the plotter to use new total time 1 year ago
Constantin Fürst 405166cbe8 add peak perf benchmark descriptors 1 year ago
Constantin Fürst 148c4c213a re-run mtsubmit tests 1 year ago
Constantin Fürst fb9164ae89 add script to disable dsa and then load a new config 1 year ago
Constantin Fürst 9886b20112 remove the multiple tests for mtsubmit and ensure that the wq will always receive the same elements to make the test fair 1 year ago
Constantin Fürst 01850cf97b add new mtsubmit test results 1 year ago
Constantin Fürst 3964da0d7a use ms10 instead of ms50 as ms50 with > 2 threads will overfill the wq of size 128 (max) and cause an error unhandled in the benchmark yet 1 year ago
Constantin Fürst f9b00a5b32 modify mtsubmit to meassure both ssaw and ms50 submit methods 1 year ago
Constantin Fürst 846fa9be43 re-run mtsubmit benchmarks for 4e 1 year ago
Constantin Fürst 6f620d6461 re-run mtsubmit benchmarks for 1e 1 year ago
Constantin Fürst 59853ffc92 remove the mtsubmit performance plotter which was based on a false assumption, modify the mtsubmit cost plotter for the new task set size 1 year ago