327 Commits (9ae2cbb5009915fa734e452f41e2540c22eb927b)
 

Author SHA1 Message Date
Constantin Fürst 3301b17497 write a chunk of implementation chapter 11 months ago
Constantin Fürst c19ffa975c reorder state chapter and add todos for the new sections, namely on hbm and query driven prefetching 11 months ago
Constantin Fürst e5b6230dc6 add todo to unfinished part of conclusion and add bullet points from implementation as futurework 11 months ago
Constantin Fürst 09a95f1e50 add todos to unfinished parts of the thesis 11 months ago
Constantin Fürst f910efd7e8 write section 4.2 on cache design 11 months ago
Constantin Fürst e4a681ac1e delete the copy-constructor for cache as copying it is undesired behaviour 11 months ago
Constantin Fürst e570a6fe69 reduce the line-count of the switch statement for dml::StatusToString by inlining return with case statements 11 months ago
Constantin Fürst 0fdf650fe4 improve the class-comments for Cache and CacheData, also free incomplete_cache_ if it has not been waited for (see comment on this) 11 months ago
Constantin Fürst 8ba716353a add Clear() function which forces flush on the entire Cache and remove non-warning/error status messages from the cacher 11 months ago
Constantin Fürst d7c5c55208 turn library into single-header mode 11 months ago
Constantin Fürst 4fa5ef6522 accept existing cache if the cached block is larger than the requested view 12 months ago
Constantin Fürst 6ab88595b7 add test for the cache-flush logic which was previously not tested 12 months ago
Constantin Fürst 7dfbed68fe handle allocation slightly different, introduce a separate function for cleaner code that does on-node memory allocation, first querry the available size and do not rely on numa_alloc_onnode to report nullptr if the size is not really available 12 months ago
Constantin Fürst 4ddd96adcb remove extra whitespace from output in main function 12 months ago
Constantin Fürst e3e17cec7b extend the main method of the small supplied test program to allow running on multiple threads 12 months ago
Constantin Fürst e5b96727cd add missing inline specifier to functions as this is header-only code 12 months ago
Constantin Fürst 9c06bd4fa9 add class-definition comments and clear some double-newlines 12 months ago
Constantin Fürst 53e05d096c add even more comments and remove an old code line that modified the reference counter of cache data from the outside 12 months ago
Constantin Fürst 52566fc13b print to cerr for bad states in the test-main 12 months ago
Constantin Fürst 46de3151a2 add a lot of comments to the code, also handle errors in the dml handlers gracefully 12 months ago
Constantin Fürst c01eafedae refactor the cacher to reduce complexity, removes the access guarantees (relaxed,immediate,...), uses the fact that other tasks will wait on atomic value change for the cache-pointer if it is nullptr to add the entry to cache structure earlier reducing cost of two threads accessing new entry at the same time, splits the offloading-cache.hpp file into two with one containing the data-class (represents a cache entry and task) and the other containing the cacher itself 12 months ago
Constantin Fürst 395d307310 fix an issue with the freeing of data in the cacher 12 months ago
Constantin Fürst f19c069b0f always perform copy from src and dst node, add another log output about the split 12 months ago
Constantin Fürst 5e30a370ce finish first stage of caching implementation and provide a rudimentary test function in the main 12 months ago
Constantin Fürst d396056230 provide first draft of implementations for the cachers functionality 12 months ago
Constantin Fürst 623366433b continue modifying the declarations for the cacher and providing some first definitions 12 months ago
Constantin Fürst f91cd3202f add project 'offloading cacher' and function declarations for the cacher in its header file 12 months ago
Constantin Fürst 3f5f5f267d note down bullet points for the content of chapter 3 (performance) 12 months ago
Constantin Fürst ef3058ed00 add missing tex files after restructure and rewrite the title page in english 12 months ago
Constantin Fürst f9e3e7e4df slight restructuring of the thesis, part of formulating the contents 12 months ago
Constantin Fürst 35ae279c3b remove todos and todo page from the tex files, remove test-texcode, add some structure points 12 months ago
Constantin Fürst f0273d526f extend the glossary entry for pcie dmr 12 months ago
Constantin Fürst 0c775fc0ce change citation style to ieee standard 12 months ago
Constantin Fürst b124d594bd improve positioning of figures for hw/sw architecture section of chapter 2 12 months ago
Constantin Fürst 1f055d84ed write section about dsa hw/sw architecture 12 months ago
Constantin Fürst e137137fbc add new glossary entries and references for chapter 2 12 months ago
Constantin Fürst f5e2a7df6b add checkibw makefile 12 months ago
Constantin Fürst cf2281f827 add dsa hw and sw block diagrams 12 months ago
Constantin Fürst 1ab7ceb955 import actual task description pdf 12 months ago
Constantin Fürst 52a026805e use different notation for glossary which results in easier reading and correct result 12 months ago
Constantin Fürst c0c75aa51b use uniform colormap in the plots and use separate output folder for the plots 12 months ago
Constantin Fürst 5f72404508 re run changed smart peak throughput benchmarks 1 year ago
Constantin Fürst 1fca956a0a update the peak throughput plotter to show the difference of smart and allnodes too 1 year ago
Constantin Fürst 791184ff10 fix mistake in benchmark descriptors for smart peak performance 1 year ago
Constantin Fürst f809eb5847 rerun copy benchmark with smart assignment 1 year ago
Constantin Fürst 6b9581b8d1 add results for the new 512mib internode hbm copy peak perf test 1 year ago
Constantin Fürst 0748826fcd use transfer size of 512mib for HBM intranode copy and modify plotter accordingly 1 year ago
Constantin Fürst 60a5ba5120 refactor the benchmark plotters and submit newly plotted graphs 1 year ago
Constantin Fürst c0f2aa2b64 re-plot the benchmarks with the new data 1 year ago
Constantin Fürst 1c7369f20e add results for smart and allnodes peak throughput benchmark 1 year ago