Abstract
In this talk, I will discuss important factors in designing a
high-performance processor core, giving a historical perspective
of where the industry has been and highlighting some of the
attributes of our latest Zen3 core.
Bio
Mike Clark is an AMD Corporate Fellow and is the Chief Architect
for x86 cores. He graduated from the University of Illinois in
1993 with a B.S. in computer engineering and started work on the
K5 processor at AMD. He served as chief architect of the Zen core
and has contributed to every x86 processor from K5 to the latest
Zen generation. A co-architect of the AMD64 ISA as well as the
AMD-V virtualization extension, he has 28 patents on these and
other technologies. He also received a master's degree in
computer engineering from the University of Texas in 2003 while
working at AMD.
Abstract
Critical sectors such as business, health, and economy, are
powered by data-driven decisions made using data exploration
tools. The efficiency of these tools, however, is mitigated by
increasing heterogeneity: On one hand, the diversity on data
formats and workloads requires either building task-specialized
systems or transforming workloads to match the expectations of a
single system, sacrificing expressiveness and structural
information. On the other hand, heterogeneity in hardware
platforms requires task specialization to the microarchitecture
of the device at hand, e.g., a CPU or a GPU. Relying on pre-set
workload expectations or microarchitectural parameters results in
long and convoluted critical execution paths. There is a tradeoff
between rigid, task- and device-optimized systems or inefficient,
general-purpose, hardware-oblivious systems.
To maintain optimal execution of diverse ad-hoc tasks on any hardware platform we need real-time intelligence, i.e., dynamic optimization of the critical code paths during execution, when all relevant information is available. Real-time intelligent systems learn and use information as the user's requests are executed to build optimized data access and filtering. I will show how a real-time intelligent database system specializes and self-optimizes itself to hardware, workload, and data at runtime, enabling next-generation database systems to exploit the capabilities of modern hardware advancements.
Bio
Anastasia Ailamaki is a Professor of Computer and Communication
Sciences at the École Polytechnique Fédérale
de Lausanne (EPFL) in Switzerland and the co-founder of RAW Labs
SA, a Swiss company developing real-time analytics
infrastructures for heterogeneous big data from multiple sources.
She earned a Ph.D. in Computer Science from the University of
Wisconsin–Madison in 2000. She received the 2019 ACM SIGMOD
Edgar F. Codd Innovations and the 2020 VLDB Women in Database
Research Award. She is also the recipient of an ERC Consolidator
Award (2013), the Finmeccanica endowed chair from the Computer
Science Department at Carnegie Mellon (2007), a European Young
Investigator Award from the European Science Foundation (2007),
an Alfred P. Sloan Research Fellowship (2005), an NSF CAREER
award (2002), and ten best-paper awards in database, storage, and
computer architecture conferences. She is an ACM fellow, an IEEE
fellow, the Laureate for the 2018 Nemitsas Prize in Computer
Science, and an elected member of the Swiss, the Belgian, the
Greek, and the Cypriot National Research Councils. She is a
member of the Academia Europaea and of the World Economic Forum
Expert Network.
The global online format of MICRO 2021 is bringing together researchers and practitioners from tens of counties and all continents. This provides us with a unique opportunity for a diverse panel of leading experts from the industry and academic across the globe to share thoughts about the present and future of microarchitecture research.
Sarita Adve
is the Richard T. Cheng Professor of Computer Science at the
University of Illinois at Urbana-Champaign. Her research
interests span the system stack, ranging from hardware to
applications. She co-developed the memory models for the C++ and
Java programming languages based on her early work on
data-race-free models. Recently, her group released ILLIXR
(Illinois Extended Reality testbed), the first open source
extended reality system. She is also known for her work on
heterogeneous systems and software-driven approaches for hardware
resiliency. She is a member of the American Academy of Arts and
Sciences, a fellow of the ACM and IEEE, and a recipient of the
ACM/IEEE-CS Ken Kennedy award, the Anita Borg Institute Women of
Vision award in innovation, the ACM SIGARCH Maurice Wilkes award,
and the University of Illinois campus award for excellence in
graduate student mentoring. As ACM SIGARCH chair, she co-founded
the CARES movement, winner of the CRA distinguished service
award, to address discrimination and harassment in Computer
Science research events. She received her Ph.D. from the
University of Wisconsin-Madison and her B.Tech. from the Indian
Institute of Technology, Bombay.
Koji Inoue
received the B.E. and M.E. degrees in computer science
from Kyushu Institute of Technology, Japan in 1994 and 1996,
respectively. He received the Ph.D. degree in Department of
Computer Science and Communication Engineering, Graduate School
of Information Science and Electrical Engineering, Kyushu
University, Japan in 2001. In 1999, he joined Halo LSI Design
& Technology, Inc., NY, as a circuit designer. He is
currently a professor of the Department of Advanced Information
Technology, Kyushu University. His research interests include
power-aware computing, high-performance computing, secure
computing, nano-photonic computing, and superconductor computing.
Onur Mutlu
is a Professor of Computer Science at ETH Zurich. He is also a
faculty member at Carnegie Mellon University, where he previously
held the Strecker Early Career Professorship. His current broader
research interests are in computer architecture, systems,
hardware security, and bioinformatics. A variety of techniques
he, along with his group and collaborators, has invented over the
years have influenced industry and have been employed in
commercial microprocessors and memory/storage systems. He
obtained his PhD and MS in ECE from the University of Texas at
Austin and BS degrees in Computer Engineering and Psychology from
the University of Michigan, Ann Arbor. He started the Computer
Architecture Group at Microsoft Research (2006-2009), and held
various product and research positions at Intel Corporation,
Advanced Micro Devices, VMware, and Google. He received the IEEE
Computer Society Edward J. McCluskey Technical Achievement Award,
the ACM SIGARCH Maurice Wilkes Award, the inaugural IEEE Computer
Society Young Computer Architect Award, the inaugural Intel Early
Career Faculty Award, US National Science Foundation CAREER
Award, Carnegie Mellon University Ladd Research Award, faculty
partnership awards from various companies, and a healthy number
of best paper or "Top Pick" paper recognitions at various
computer systems, architecture, and hardware security venues. He
is an ACM Fellow "for contributions to computer architecture
research, especially in memory systems", IEEE Fellow for
"contributions to computer architecture research and practice",
and an elected member of the Academy of Europe (Academia
Europaea).
Per Stenström
is a professor of computer engineering at Chalmers University of
Technology, Sweden. His research interests are in Computer
Architecture. He has authored or co-authored four textbooks and
about 200 publications and 20 patents in this area. He is known
for his contributions to high-performance memory systems which
has awarded him a Fellow of the ACM and the IEEE among other
awards. He has acted as editor-in-chief and program chair of
prestigious scientific journals and conferences. He has been
program chair/co-chair of the IEEE/ACM Symposium on Computer
Architecture, the IEEE High-Performance Computer Architecture
Symposium, the IEEE Parallel and Distributed Processing
Symposium, ACM International Conference on Supercomputing and
IEEE/ACM PACT. He is a member of the Royal Swedish Academy of
Engineering Sciences, Academia Europaea and the Royal Spanish
Academy of Engineering Science.
Sreenivas Subramoney
is currently a Senior Principal Engineer at Intel Corporation
where he heads the Processor Architecture Research (PAR) Lab at
Intel Labs. PAR Lab leads research into futuristic
high-performance and highly-secure CPUs to extend Intel's
general-purpose compute leadership, and into next-generation
processor architectures for AI and heterogeneous memory
architectures. In prior roles, Sreenivas led the performance
architecture of multiple generations of Intel's flagship
microprocessor client and server CPUs, co-developed Intel's
first Java Virtual Machine and was a key developer of the 64-bit
Linux kernel for Intel's Itanium product family. Sree received
his B.Tech. in Computer Science and Engineering from IIT Madras,
India in 1995, and his M.S. degree in Computer Engineering at
The Ohio State University, USA in 1996 and has worked for Intel
since then.
Steve Keckler
is the Vice President of Architecture
Research at NVIDIA and Adjunct Professor of Computer Science at
the University of Texas at Austin, where he served as a
full-time faculty member (full professor with tenure) from 1998
to 2012. At NVIDIA, Dr. Keckler focuses on parallel,
energy-efficient architectures that span mobile through
supercomputing platforms. He is a Fellow of the ACM, a Fellow of
the IEEE, an Alfred P. Sloan Research Fellow, and a recipient of
the NSF CAREER award, the ACM Grace Murray Hopper award, the
President's Associates Teaching Excellence Award at UT-Austin,
and the Edith and Peter O'Donnell award for Engineering. He
earned a B.S. in Electrical Engineering from Stanford University
and an M.S. and a Ph.D. in Computer Science from the
Massachusetts Institute of Technology.
Abstract
The compute and memory demands from state-of-the-art neural
networks have increased several orders of magnitude in just the
last couple of years, and there's no end in sight. Traditional
forms of scaling chip performance are necessary but far from
sufficient to run the ML models of the future. In addition to the
chip, end-to-end system and software co-design is the only way to
satisfy the performance demand. It requires vertical design
across the entire technology stack: from the chip architecture,
to the system and cluster design, through the compiler and
software, and even unlocking the flexibility to rethink the
neural network algorithms themselves.
In this talk, we will explore the fundamental properties of neural networks and why they are not well served by traditional architectures. We will examine how co-design can relax the traditional boundaries between technologies and enable designs specialized for neural networks with new architectural capabilities and performance. This co-design approach enables innovations such as wafer-scale chips, core sparse datapaths, specialized memories and interconnects, novel software mappings and execution models, and highly efficient sparse neural networks. We will explore this rich new design space using the Cerebras architecture as a case study, highlighting design principles and tradeoffs that enable the ML models of the future.
Bio
Sean Lie is co-founder and Chief Hardware Architect at Cerebras
Systems, which builds high performance ML accelerators. Prior to
Cerebras, Sean was a Fellow and Chief Data Center Architect at
AMD where he was responsible for the architecture of the SeaMicro
line of distributed servers. He holds a BS and MEng in Electrical
Engineering and Computer Science from MIT. Sean's primary
interests are in high performance computer architecture and
hardware/software codesign in areas including transactional
memory, networking, storage, and ML accelerators.
Best Paper Award
Student Research Competition Winners
MICRO Hall of Fame
MICRO Test of Time Award
B. Ramakrishna Rau Award