This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

9 lines
391 B

  1. ## implemented
  2. - 1 to n engines per group
  3. - 1 to n threads running on one specific core / dsa engine
  4. - copy inside and across NUMA borders
  5. - cross-copy: 2 engines copying from their numa domain to the domain of the other
  6. - all with "packet sizes" of 1KiB, 2KiB, 4KiB, 8KiB, ..., 1GiB
  7. - all with both CPU and DSA for comparison
  8. ## missing
  9. - batch vs single submissions
  10. - effect of fence/drain