This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.
 
 
 
 
 
 

237 B

cross-copy

  • for internode copy, the src-node-engine is only slightly faster than the dst
  • for intersocket copy, the src-node-engine is faster than dst-node-engine
  • using both in tandem can give about 1.4x speedup over only src-node