Using NVIDIA Index in ALYA MultiPhysics Christopher Lux (NV), Vishal - - PowerPoint PPT Presentation
Using NVIDIA Index in ALYA MultiPhysics Christopher Lux (NV), Vishal - - PowerPoint PPT Presentation
www.bsc.es Interactive HPC: Large Scale In-Situ Visualization Using NVIDIA Index in ALYA MultiPhysics Christopher Lux (NV), Vishal Mehta (BSC) and Marc Nienhaus (NV) May 8 th 2017 Barcelona Supercomputing Center Marenostrum 4 13.7
Barcelona Supercomputing Center
Marenostrum 4
- 13.7 PetaFlop/s
- General Purpose Computing
▪ 3400 nodes of Xeon, 11 PF/s
- Emerging Technologies
▪ Power 9 + Pascal 1.5 PF/s ▪ Knights Landing and Knights Hill 0.5 PF/s ▪ 64bit ARMv8 0.5 PF/s
2
Research at BSC
3
EARTH SCIENCES
To develop and implement global and regional state-
- f-the-art models for short-
term air quality forecast and long-term climate applications
LIFE SCIENCES
To understand living
- rganisms by means of
theoretical and computational methods (molecular modeling, genomics, proteomics) CASE
To develop scientific and engineering software to efficiently exploit super- computing capabilities (biomedical, geophysics, atmospheric, energy, social and economic simulations)
COMPUTER SCIENCES
To influence the way machines are built, programmed and used: programming models, performance tools, Big Data, computer architecture, energy efficiency
4
ALYA System: Large Scale Computational Mechanics
5
6
ALYA HPC Context
7
ALYA HPC Context
ALYA RED
8
9
Computational Cardiac Model
Applications ▪ Pacemaker applications ▪ Computational analysis of malfunctioning tissue patches ▪ Computational Drug testing on cardiac tissue.
10
Computational Cardiac Model
11
Pacemaker Application
12
Pacemaker Application
13
Computational Drug Testing
Christopher Lux (NV), Vishal Mehta (BSC) and Marc Nienhaus (NV) May 8th 2017
INTERACTIVE HPC: LARGE SCALE
IN-SITU VISUALIZATION USING NVIDIA INDEX IN ALYA MULTIPHYSICS
15
NVIDIA INDEX
GPU-cluster aware solution High-quality and scalable visualization of large-scale datasets In-situ visualization Commercial software Available and deployed in production
Scalable, Interactive Visual Computing
16
NVIDIA Quadro Workstation NVIDIA Quadro VCA or DGX-1
NVIDIA HPC Clusters
Performance, dataset size, number of pixels, visual quality…
SCALABILITY AS ENABLER
17
Scientific Data Visualization
18
Time-Varying Data Visualization
19
Time-Varying Data Visualization
Simulation data source: A Numerical Study of High-Pressure Oxygen/Methane Mixing and Combustion of a Shear Coaxial Injector, Nan Zong & Vigor Yang, AIAA 2005
20
In-Trans and In-Situ Visualization
21
Computational Heart
22
BACKGROUND
23
DISTRIBUTED PARALLEL RENDERING
[..]
* image compositing
[..]
Sort-Last Rendering (multi-GPU)
24
DISTRIBUTED PARALLEL RENDERING
[..]
* image compositing
Sort-Last Rendering (Cluster of multi-GPU Nodes)
[..]
Cluster of VCAs
25
DISTRIBUTED DATATYPES
Volume datatypes
▪ Regular ▪ Sparse ▪ Unstructured/Irregular
Surface-geometry datatypes
▪ Height field ▪ Triangle mesh
Various Application Domains
26
IN-TRANS AND IN-SITU VISUALIZATION
27
TRADITIONAL VISUALIZATION PIPELINE
5/15/2 017 Simulation Cluster Data Storage
e.g. Unstructured Data
NVIDIA IndeX Visualization Cluster
28
Stream
Visualize and Animate Interact and Explore
TIME-SERIES DATA VISUALIZATION
Visualize Pre-calculated ALYA Simulation Results
Terabyte time-varying simulation data of nasal system
29
IN-SITU (IN-TRANS) VISUALIZATION PIPELINE
Simulation Cluster NVIDIA IndeX Visualization Cluster Network
Unstructured Data Unstructured Data Unstructured Data
30
IN-SITU VISUALIZATION PIPELINE
5/15/2 017 Combined Simulation and Visualization Cluster
31
IN-SITU/IN-TRANS SUPPORT
Parallel jobs executed locally or remotely Direct access to local host and device data Fast RDMA memory transfers User-defined affinity and spatial subdivision Application-driven updates
▪ Push updated data when ready
Rendering-driven updates
▪ Request computation updates for active data
Compute Result Integration
Clustered neuron activity
32
IN-TRANS RESULT TRANSFERS
GPUDirect RDMA NVLink high-speed interconnect between system memory and GPU (IBM and NVIDIA)
Fast Data Transfers to Rendering Nodes/GPUs
Page-Locked System Memory Page-Locked System Memory
RDMA
(over InfiniBand) CUDA GPU Memory CUDA GPU Memory
33
Visualize Interact and Explore
COMPUTATIONAL HEART
In-Situ Simulation and Visualization
Simulate/Compute (ALYA) Parameterize and Steer
34
NVIDIA INDEX AVAILABILITY
35
NVIDIA INDEX 1.4
In-Situ/In-Trans Visualization Support
NVIDIA IndeX 1.4 (released 07/2016)
Support of 32bit, 16bit, 8bit fixed point, 32bit floating point and RGBA (8bit) regular volumes Dynamic streaming and GPU caching of time-varying volume data Irregular volumes and sparse volumes Built-in volume shading capabilities Multi-view capabilities Zero-copy RDMA/GPUDirect compute integration infrastructure User-defined affinity and spatial subdivision support Architecture and API for in-situ/in-trans visualization (compute integration) Dynamic workload balancing Advanced CUDA memory management, error handling and logging MPI/NVIDIA IndeX interprocess coupling (CUDA IPC and shared memory)
36
NVIDIA INDEX 1.5 OUTLOOK
User-defined Rendering Kernel Components
37
NVIDIA INDEX 1.5 OUTLOOK
User-defined Rendering Kernel Components
IN-SITU VISUALIZATIONS
38
39
Challenges
Simulation
MPI MPI MPI
. .
Unstructured Mesh Uneven Spatial partitions Balanced computations
Index Rendering
MPI MPI MPI
. .
Cubical Scene region Even spatial partitions Balanced rendering load
- Parallel Operations
- Maintain frame rates
- Steering Simulation
- Data Affinity
- Render @compute
- In-trans approach
40
Multi-code Coupling in ALYA
- All
spatial interpolation
- n
spatial domain in structured and unstructured meshes
- Allows
setting send and receive frequencies to synchronize simulation times.
- Allows coupling with third party
codes
- Parallel
and Asynchronous MPI coupling
41
Ingredients of Coupling in ALYA
- WHAT
The underlying variables
- WHERE
Surface, Volume, etc.
- WHEN
Time step, iteration step
- HOW
Algorithmic interpolation mpirun -np 8 Alya.x fluid : -np 4 Test.x : -np 4 Alya.x solidz
42
Coupling for IN-SITU Visualizations
Simulation
MPI MPI MPI
. . Index Rendering
MPI MPI MPI
. .
- Allows optimizing resources for compute
and render
- Application
Driven Updates, push simulation data.
- Allows inter-operability between coarser
and finer meshes, adjusting data updates.
- Maintains high frame rates and allows
interaction with the volume.
- Can couple multiple physics apps to a
single rendering app.
43
Steering Simulations
- Steering is Application Specific
- Steering
simulations requires handling interrupts.
- Interrupt communicated through
backward coupling.
- A
general approach by scalar/vector interrupts, and user defined function to handle the variables of simulation.
Simulation Time S1 Time S2 Time S3
. .
Rendering
Time S1 Time S2 Time S3
. . Coupling
Interrupt handler Function applied to fields
44
Summary: In-situ Visualization
- Index enables better insights into simulation data through professional visualization techniques
- Scalability is the enabler for HPC in-situ visualization.
- Multiphysics coupling is the key to scalability, and resource management for in-situ.
45
SELF-PACED LABS
NVIDIA IndeX for ParaView plugin hands-on Location: ▪ Self-paced lab area on lower level Dates: ▪ Monday 1:00 – 5:00pm ▪ Tuesday 9:30 – 11.30am ▪ Wednesday 1:00 – 5:00pm
Interactive HPC Volume Visualization in ParaView
46
INTERACTIVE DEMO
Live demonstration of in-situ visualization Interactive steering of simulation parameters Location: ▪ NVIDIA demo-booth in exhibit hall 1
Interactiver HPC: Large Scale In-Situ Visualization using NVIDIA IndeX in ALYA MultiPhysics
Christopher Lux, NVIDIA Vishal Mehta, BSC Marc Nienhaus, NVIDA