This website works better with JavaScript.
Home
Explore
Help
Sign In
constantin
/
bachelor-thesis
Watch
1
Star
0
Fork
0
Code
Issues
Pull Requests
Projects
Releases
Wiki
Activity
251
Commits
1
Branch
0
Tags
92 MiB
Tree:
f16d67f67e
master
Branches
Tags
${ item.name }
Create tag
${ searchTerm }
Create branch
${ searchTerm }
from 'f16d67f67e'
${ noResults }
Commit Graph
28 Commits (f16d67f67ec31a26238462a2e4bbf64cba9fb7aa)
Author
SHA1
Message
Date
Constantin Fürst
cf14cf34ac
:wqsubdivide the descriptors for peak perf allnodes into fromn0 and fromn1to15
11 months ago
Constantin Fürst
f2059d4d47
use 12 threads for each brute benchmark
11 months ago
Constantin Fürst
439ec97c8e
remove smart sw copy, add brute sw copy bench
11 months ago
Constantin Fürst
e9090fb482
redo benchmarks for allnodes cpu due to mistake in modifying the json files
11 months ago
Constantin Fürst
317a74d164
benchmark software peak performance for smart and brute force from node 0 for thesis
11 months ago
Constantin Fürst
791184ff10
fix mistake in benchmark descriptors for smart peak performance
1 year ago
Constantin Fürst
0748826fcd
use transfer size of 512mib for HBM intranode copy and modify plotter accordingly
1 year ago
Constantin Fürst
475b2d5b5a
provide two benchmarks for peak performance, one that is brute force and one that uses smart node assignment and therefore lower utilization
1 year ago
Constantin Fürst
7ced0bce4c
fix an issue in the python script that lead to references being modified which caused bad node settings
1 year ago
Constantin Fürst
6d9002d1e7
use different engine configuration depending on whether intra socket (all 4 engines on the socket) or inter socket (src and destination engine, cross copy) is the copy type
1 year ago
Constantin Fürst
7f7230197c
dont submit multiple copies - test takes too long and this has almost no effect at work size of 1gib
1 year ago
Constantin Fürst
e9807df09c
use all 8 engines for each copy task, as the engine location does not affect performance
1 year ago
Constantin Fürst
5089936f30
prepare peak throughput plotter for multi-node results
1 year ago
Constantin Fürst
1682b84fb4
use a batch size of 8 to check whether multiple engines can increase throughput
1 year ago
Constantin Fürst
575ff8cf82
turn node -1 into node 7 again - this got messed up by a hastily written modification script
1 year ago
Constantin Fürst
cf675e37f5
dont pin the thread to hbm nodes but to the hbm src node minus 8 in peak perf benchmark
1 year ago
Constantin Fürst
808c8f3ae7
run the engine location benchmarks with size of 1gib only 10 times
1 year ago
Constantin Fürst
b37968dd3f
rename engine location benchmarks, modify plotter to support missing configuration files as the new cases are not universally applicable to all configurations
1 year ago
Constantin Fürst
a548d9afe5
restructure mtsubmit benchmarks
1 year ago
Constantin Fürst
b7cae18b6d
restructure the engine location bench, correct and update the plotter to use new total time
1 year ago
Constantin Fürst
405166cbe8
add peak perf benchmark descriptors
1 year ago
Constantin Fürst
9886b20112
remove the multiple tests for mtsubmit and ensure that the wq will always receive the same elements to make the test fair
1 year ago
Constantin Fürst
3964da0d7a
use ms10 instead of ms50 as ms50 with > 2 threads will overfill the wq of size 128 (max) and cause an error unhandled in the benchmark yet
1 year ago
Constantin Fürst
f9b00a5b32
modify mtsubmit to meassure both ssaw and ms50 submit methods
1 year ago
Constantin Fürst
d72e28adb8
make mtsubmit a multisubmit
1 year ago
Constantin Fürst
151ddd79d5
set repetition count of tests to 1000 again and replace the 1 GiB tests with 32 MiB
1 year ago
Constantin Fürst
21194b7fde
fix the benchmarker script by testing for the correct amount of parameters and obtaining the test name correctly
1 year ago
Constantin Fürst
d3a09181ce
move the existing results into a new folder and modify them to be only benchmark descriptors
1 year ago