This contains my bachelors thesis and associated tex files, code snippets and maybe more. Topic: Data Movement in Heterogeneous Memories with Intel Data Streaming Accelerator
You can not select more than 25 topics Topics must start with a letter or number, can include dashes ('-') and can be up to 35 characters long.

43 lines
5.1 KiB

  1. \chapter{Introduction}
  2. \label{chap:intro}
  3. % Die Einleitung schreibt man zuletzt, wenn die Arbeit im Großen und
  4. % Ganzen schon fertig ist. (Wenn man mit der Einleitung beginnt - ein
  5. % häufiger Fehler - braucht man viel länger und wirft sie später doch
  6. % wieder weg). Sie hat als wesentliche Aufgabe, den Kontext für die
  7. % unterschiedlichen Klassen von Lesern herzustellen. Man muß hier die
  8. % Leser für sich gewinnen. Das Problem, mit dem sich die Arbeit befaßt,
  9. % sollte am Ende wenigsten in Grundzügen klar sein und dem Leser
  10. % interessant erscheinen. Das Kapitel schließt mit einer Übersicht über
  11. % den Rest der Arbeit. Meist braucht man mindestens 4 Seiten dafür, mehr
  12. % als 10 Seiten liest keiner.
  13. \begin{itemize}
  14. \item Provide an overview of the challenges posed by Non-Uniform Memory Architectures (NUMA) with diverse non-functional properties, such as varying throughput, in modern hardware design.
  15. \item Introduce the Intel Data Streaming Accelerator (Intel DSA) and its role in optimizing streaming data movement operations for various applications.
  16. \item
  17. \item Explain the concept of Query Driven Prefetching (\gls{qdp}) and its significance in optimizing database performance through intelligent prefetching.
  18. \item Clearly state the main objectives of thesis, emphasizing the analysis and characterization of the Intel DSA architecture, and application of dsa to qdp
  19. \item Outline the primary contributions of the work, including the implementation of a cache within \gls{qdp}, the performance and analysis of microbenchmarks
  20. \item Emphasize the significance of these contributions in addressing the challenges of prefetching in a heterogeneous memory system and providing practical insights for other researchers and developers.
  21. \item Provide an overview of the structure, outlining the main sections and their respective content.
  22. \end{itemize}
  23. Alongside Intel DSA, the concept of Query Driven Prefetching (\gls{qdp}) emerges as a strategic methodology for optimizing database performance through intelligent prefetching. This section elaborates on the principles of \gls{qdp}, emphasizing its significance in enhancing database query execution, particularly in heterogeneous memory systems. \par
  24. As a response to these challenges, this thesis navigates through the complex terrain of optimizing data movement operations. Central to this exploration is the introduction of the Intel Data Streaming Accelerator (Intel DSA). This groundbreaking technology plays a pivotal role in enhancing streaming data movement operations across various applications. Understanding the architecture and functionality of Intel DSA becomes paramount in the context of addressing the challenges presented by NUMA. \par
  25. The main objectives of this thesis are twofold. Firstly, a comprehensive analysis and characterization of the Intel DSA architecture is undertaken. This involves a deep dive into the intricacies of Intel DSA and its role in optimizing data movement operations. Secondly, the focus extends to the application of Intel DSA in the domain-specific context of \gls{qdp}. This thesis aims to explore how Intel DSA can be strategically utilized to address the challenges posed by heterogeneous memory systems, providing insights into the integration of data streaming acceleration with intelligent prefetching. \par
  26. This work introduces substantial contributions to the field. The implementation of a cache within the \gls{qdp} framework is a key highlight. Additionally, the thesis includes a detailed examination and analysis of microbenchmarks. These benchmarks serve as a practical guideline, offering insights for optimal application of Intel DSA in diverse scenarios. Aligned with the generic nature of caching, this implementation addresses the specific requirements of \gls{qdp}, offering a nuanced solution to the challenges of prefetching in heterogeneous memory systems. \par
  27. To facilitate a coherent exploration, this thesis is structured into distinct sections. The Technical Background section delves into essential elements, including High Bandwidth Memory (HBM), \gls{qdp}, Intel DSA, the programming interface, and the system setup and configuration. Following this, the Performance Microbenchmarks section meticulously examines benchmarking methodologies, specific benchmarks employed, and a detailed analysis of the results. \par
  28. The subsequent sections, Design and Implementation, unravel the practical aspects of the work. Detailed task descriptions, cache design intricacies within \gls{qdp}, and insights into locking mechanisms and accelerator usage are presented. The Evaluation section provides a comprehensive assessment of the implemented solutions and benchmarks. \par
  29. Finally, the Conclusion and Outlook section wraps up the thesis by summarizing findings, emphasizing conclusions, and offering a glimpse into potential future work and developments in the field. This structured approach ensures a logical flow, enabling readers to navigate seamlessly through the intricate exploration of Intel DSA, \gls{qdp}, and their applications in heterogeneous memory systems. \par
  30. %%% Local Variables:
  31. %%% TeX-master: "diplom"
  32. %%% End: