BEGIN:VCALENDAR
VERSION:2.0
PRODID:-//Northeastern University College of Engineering - ECPv6.16.2//NONSGML v1.0//EN
CALSCALE:GREGORIAN
METHOD:PUBLISH
X-WR-CALNAME:Northeastern University College of Engineering
X-ORIGINAL-URL:https://coe.northeastern.edu
X-WR-CALDESC:Events for Northeastern University College of Engineering
REFRESH-INTERVAL;VALUE=DURATION:PT1H
X-Robots-Tag:noindex
X-PUBLISHED-TTL:PT1H
BEGIN:VTIMEZONE
TZID:America/New_York
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20230312T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20231105T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20240310T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20241103T060000
END:STANDARD
BEGIN:DAYLIGHT
TZOFFSETFROM:-0500
TZOFFSETTO:-0400
TZNAME:EDT
DTSTART:20250309T070000
END:DAYLIGHT
BEGIN:STANDARD
TZOFFSETFROM:-0400
TZOFFSETTO:-0500
TZNAME:EST
DTSTART:20251102T060000
END:STANDARD
END:VTIMEZONE
BEGIN:VEVENT
DTSTART;TZID=America/New_York:20240903T100000
DTEND;TZID=America/New_York:20240903T110000
DTSTAMP:20260603T051538
CREATED:20240820T175639Z
LAST-MODIFIED:20240820T175639Z
UID:45129-1725357600-1725361200@coe.northeastern.edu
SUMMARY:Yuhui Bao PhD Dissertation Defense
DESCRIPTION:Name:\nYuhui Bao \nTitle:\nA Design Methodology for Producing Highly-Adaptable and High-Performance Simulation Frameworks \nDate:\n9/3/2024 \nTime:\n10:00:00 AM\nCommittee Members:\nProf. David Kaeli (Advisor)\nProf. Ningfang Mi\nProf. Yifan Sun (William and Mary) \nAbstract:\nComputer architecture simulators play an essential role in the development and optimization of computer hardware. A variety of simulators have been developed to explore the design space of CPUs\, GPUs\, and customer accelerators. As GPUs continue to grow in popularity for accelerating demanding applications\, such as high-performance computing and machine learning\, GPU architects have been pushing the envelope of GPU performance in every new GPU generation. GPU vendors (e.g.\, NVIDIA and AMD) have been introducing subsequent generations of GPU architectures and products with updated instruction set architectures (ISAs) and new microarchitectural features every 2-3 years. Modeling the state-of-the-art architecture is a crucial feature of GPU simulators\, which are used to characterize and accelerate challenging workloads facilitating performance evaluation and design exploration. However\, the effort required to design and construct an accurate and performant simulator is huge. Due to the rapid rate of innovation in GPU technology\, any simulator that is over-customized to capture the design of a specific architecture will quickly become outdated. Thus\, we need to develop a design methodology for simulators that can guard against this trend\, embracing future architectures. \nIn this dissertation\, we propose a design methodology for producing highly-adaptable and high-performance simulation frameworks. We aim to design simulators featuring high adaptability\, being able to accommodate future alterations or extensions\, high performance and high fidelity. We leverage the Akita simulator framework to enable the modular and extensible design of various GPU components. To fulfill the goal of high fidelity\, we design a set of microbenchmarks to evaluate individual GPU subsystems. We demonstrate how we follow our design methodology to achieve a highly-adaptable and accurate simulator — NaviSim\, which provides the flexibility to support simulation of three different ISAs. To demonstrate the full utility of the NaviSim simulator\, we conduct a performance study of the impact of individual architecture features revealing the high flexibility and configurability of NaviSim. In addition\, we showcase how NaviSim’s high adaptability contributes to design space exploration\, offering solutions to enhance the performance of real-world demanding applications. \nFast simulation speed is one of the key requirements of any simulators. NaviSim is designed to support multi-threaded execution\, which is able to leverage the parallel capabilities offered by today’s multi-core CPUs\, enabling parallel simulation. In this thesis we identify key performance bottlenecks in terms of both serial and parallel simulation execute modes and optimize simulation speed. We also present lessons learned about efficient simulator design and provide guidance for future simulator developers.
URL:https://coe.northeastern.edu/event/yuhui-bao-phd-dissertation-defense/
END:VEVENT
END:VCALENDAR