This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

4 lines
237 B

  1. # cross-copy
  2. - for internode copy, the src-node-engine is only slightly faster than the dst
  3. - for intersocket copy, the src-node-engine is faster than dst-node-engine
  4. - using both in tandem can give about 1.4x speedup over only src-node