3rd natESM Training Workshop
from
Tuesday, 5 November 2024 (00:00)
to
Wednesday, 6 November 2024 (16:00)
Monday, 4 November 2024
Tuesday, 5 November 2024
08:15
Bus shuttle
Bus shuttle
08:15 - 08:30
Room: Rotunde
08:30
Arrival, registration and welcome coffee
Arrival, registration and welcome coffee
08:30 - 09:00
Room: Rotunde
09:00
Welcome!
-
Sabine Grießbach
Welcome!
Sabine Grießbach
09:00 - 09:15
Room: Rotunde
Welcome, agenda, house keeping
09:15
GPUs for Exascale: Introduction to the JUPITER System and its GPUs
-
Andreas Herten
(
+Team
)
GPUs for Exascale: Introduction to the JUPITER System and its GPUs
Andreas Herten
(
+Team
)
09:15 - 10:45
Room: Rotunde
With JUPITER, Europe's first exascale system is right on the doorstep. The system features two modules, a CPU-centric JUPITER Cluster and a highly-scalable JUPITER Booster, using nearly 24 000 GPUs for 1 EFLOP/s of sustained HPL performance. The talk will introduce the JUPITER system design, the current status, and key defining features of the GPU technology selected to enable this computational milestone in Europe.
10:45
Coffee break
Coffee break
10:45 - 11:15
Room: Rotunde
11:15
Parallel Performance Analysis and Score-P Basics
-
Markus Geimer
Parallel Performance Analysis and Score-P Basics
Markus Geimer
11:15 - 12:15
Room: Rotunde
To effectively harness the computing capabilities of todays and future supercomputing systems, performance analysis and optimization should be a regular activity during scientific software development. Instead of using do-it-yourself solutions usually based on coarse-grained timers (e.g., time per timestep or solver iteration), developers of scientific code bases can resort to a variety of spezialized tools that have been specifically developed to assist them with this task. In this part of the workshop, we will introduce the open-source tools Score-P and Cube, and explore their usage and capabilities with a number of hands-on exercises.
12:15
Lunch break
Lunch break
12:15 - 13:30
13:30
Tutorial: Parallel Performance Analysis and Score-P Basics
Tutorial: Parallel Performance Analysis and Score-P Basics
13:30 - 15:30
Room: Rotunde
In this first session, we will introduce the key concepts of parallel performance analysis using Score-P. After introducing some basic terminology and the tools ecosystem centered around the Score-P instrumentation and measurement system, the general workflow of using Score-P to collect and the Cube graphical user interface to examine application profiles will be explained via hands-on exercises with a smaller-sized benchmark code.
15:30
Coffee break
Coffee break
15:30 - 16:00
Room: Rotunde
16:00
Tutorial: Applying Score-P to an ESM Code
Tutorial: Applying Score-P to an ESM Code
16:00 - 17:15
Room: Rotunde
The goal of this second session is to intensify the knowledge gained during the first session using a series of hands-on exercises with a production ESM application. These cover basic performance analysis using collected profiles, as well as a cross-experiment scalability analysis. If time permits, the challenges in examining coupled MPMD simulations will also be addressed.
17:15
Profiling ICON: Facing the Complex Realities
-
Markus Geimer
Profiling ICON: Facing the Complex Realities
Markus Geimer
17:15 - 17:45
Room: Rotunde
To wrap up, we will present an experience report on applying Score-P to the ICON weather and climate model. We will summarize the steps taken, the challenges we encountered, and how they have been addressed.
17:45
Wrap-up Performance Analysis
Wrap-up Performance Analysis
17:45 - 18:00
Room: Rotunde
18:15
Transfer
Transfer
18:15 - 18:55
19:00
Dinner
Dinner
19:00 - 21:30
Wednesday, 6 November 2024
08:15
Bus shuttle
Bus shuttle
08:15 - 08:35
Room: Rotunde
08:35
Arrival
Arrival
08:35 - 09:00
Room: Rotunde
09:00
Taming the Beasts: A Practical Overview of GPU Programming Models
-
Andreas Herten
(
Forschungszentrum Jülich
)
Taming the Beasts: A Practical Overview of GPU Programming Models
Andreas Herten
(
Forschungszentrum Jülich
)
09:00 - 09:45
Room: Rotunde
JUPITER will utilize nearly 24 000 NVIDIA GPUs to enter the Exascale Era. While CUDA is the native programming model for NVIDIA GPUs, there are alternatives which can offer higher productivity or more portability, like OpenACC, OpenMP, or Kokkos. This tutorial will present the relevant programming models and offer exercises to showcase the respective strengths.
09:45
Fast-Track to High Performance: Exploring OpenACC
Fast-Track to High Performance: Exploring OpenACC
09:45 - 10:45
Room: Rotunde
10:45
Coffee break
Coffee break
10:45 - 11:05
Room: Rotunde
11:05
Fast-Track to High Performance: Exploring CUDA
-
Andreas Herten + Team
Fast-Track to High Performance: Exploring CUDA
Andreas Herten + Team
11:05 - 12:30
Room: Rotunde
12:30
Lunch break
Lunch break
12:30 - 13:30
13:30
Fast-Track to High Performance: Exploring Kokkos
-
Andreas Herten +Team
(
+Team
)
Fast-Track to High Performance: Exploring Kokkos
Andreas Herten +Team
(
+Team
)
13:30 - 14:45
Room: Rotunde
14:45
Two ESM Experiences of Performance Portability: Born Into It or Maturing Into It
-
Daniel Caviedes-Voullieme
Two ESM Experiences of Performance Portability: Born Into It or Maturing Into It
Daniel Caviedes-Voullieme
14:45 - 15:15
Room: Rotunde
15:15
Wrap-up + End of Workshop
-
Sabine Grießbach
Wrap-up + End of Workshop
Sabine Grießbach
15:15 - 15:30
Room: Rotunde