Modeling & Simulation, Testing & Validation
8/22/2018
HIGH PERFORMANCE COMPUTING FRAMEWORK FOR CO-SIMULATION OF VEHICLE – TERRAIN INTERACTION
Radu Serban, Nicholas Olsen, Dan Negrut University of Wisconsin – Madison
Testing & Validation HIGH PERFORMANCE COMPUTING FRAMEWORK FOR - - PowerPoint PPT Presentation
Modeling & Simulation, Testing & Validation HIGH PERFORMANCE COMPUTING FRAMEWORK FOR CO-SIMULATION OF VEHICLE TERRAIN INTERACTION Radu Serban, Nicholas Olsen, Dan Negrut University of Wisconsin Madison 8/22/2018 Modeling &
8/22/2018
Radu Serban, Nicholas Olsen, Dan Negrut University of Wisconsin – Madison
8/22/2018
Acknowledgements
– Antonio Recuero [Goodyear Tire & Rubber Co.] – Hammad Mazhar [NVIDIA] – Michael Taylor [UW and Harley-Davidson]
– Hiroyuki Sugiyama [University of Iowa] – Bryan Peterson [University of Iowa]
8/22/2018
Off-Road Vehicle Mobility
8/22/2018
Off-Road Vehicle Mobility: A Multi-Physics Problem
– Rigid body models of ground vehicles – Full vehicle subsystems (suspensions, steering, driveline, anti-roll bars, etc.) – Models for powertrain, driver (open/close loop)
– Rigid tire – Empirical tire models (Pacejka, Fiala) – Flexible FEA tire models
– Semi-empirical soil model (Bekker-Wong) – Granular terrain (DEM) – Continuum soil models (FEA-based)
– Lagrangian-Lagrangian approach
Chassis Wheels Tires Terrain Suspension Joints Contact
Internal Terrain Forces Internal Tire Forces
8/22/2018
Project Chrono
– Many-body dynamics – Nonlinear Finite Element Analysis – Fluid-Solid Interaction Problems
Chrono: An open source multi-physics dynamics engine, HPC in Sci. and Eng. – Lecture Notes in CS, Springer, 2016
8/22/2018
Project Chrono - Organization
Hardware CPU, Multicore Hardware Multiple GPU Hardware Multiple Nodes
HPC Chrono API
MBD API FEA API FSI API … API
Support for Classical Multi-Body Dynamics Support for Structural And Volumetric Elements Support for Fluid-Solid Interaction Future Chrono Expansion
Advanced Chrono Use Low-Entry Point Chrono Use
Chrono Vehicle Chrono Granular Chrono Robotics Chrono FEA Chrono FSI Chrono … Chrono MBD
Chrono::Engine
Equation Formulation Equation Solution Collision Detection HPC Support Pre/Post Processing Chrono Distributed Chrono Parallel
MPI CUDA OpenMP
8/22/2018
Bottleneck – Computational Times But what about everything in one simulation?
8/22/2018
Why Co-Simulation? Flexibility and efficiency consideration:
– no suitable integration scheme for NSC involving FEA
– implicit, adaptive HHT scheme for FEA tires – a semi-implicit Euler scheme for the granular terrain
structure of each subsystem:
– multi-core (OpenMP) evaluation of FEA internal forces and Jacobians – multi-core (OpenMP), GPU (CUDA), or distributed (MPI) granular terrain simulation
8/22/2018
Co-Simulation of Single Tire on Granular Terrain
High fidelity approach for vehicle mobility simulation: Nonlinear finite element tires operating on granular material,
8/22/2018
Co-Simulation of Full Vehicle on Granular Terrain
(penalty)
elements
@ 2.30GHz
8/22/2018
Co-Simulation of Full Vehicle on Granular Terrain
A co-simulation framework for high-performance, high-fidelity simulation of ground vehicle—terrain interaction,
Rear tires fall onto front tires initial footprints Initial drop causes large vertical contact forces Once rear tires run over settled terrain, they have larger net forces and lower resistance forces
8/22/2018
Performance Bottleneck: Granular Terrain Simulation
Straight line maneuver over granular terrain Moving patch granular terrain option
8/22/2018
Chrono::Distributed – Philosophy and Design
8/22/2018
Chrono::Distributed – Domain Decomposition
– Statically divide a predefined domain into subdomains for each MPI rank – Minimize inter-node communication with: – Non-blocking MPI – Point-to-point communications – No global collective operations
– SIMD vectorization from AVX – Multi-core shared-memory parallelism from OpenMP – Multi-node distributed-memory parallelism from MPI
8/22/2018
Chrono::Distributed – Strong Scaling Results
Ranks Number Particles Ratio Wall-clock Time (s / s) Parallel Efficiency 1 1,236,372 1 23,490 – 2 1,236,372 1 11,827 0.993 4 1,236,372 1 5,954 0.986 8 1,236,372 1 2,773 1.059 16 1,236,372 1 1,440 1.020 32 1,236,372 1 712 1.031 Parallel Efficiency 𝐹𝑇 𝑜 =
𝑈(1) 𝑜𝑈(𝑜)
Cray XC30; dedicated Cray Aries network nodes: 2 x 12 core Intel Xeon CPU E5-2697 v2 @ 2.7 GHz
8/22/2018
Chrono::Distributed – Weak Scaling Results
Ranks Number Particles Ratio Wall-clock Time (s / s) Parallel Efficiency 1 1,236,372 1 23,490 – 2 2,472,744 2 23,901 0.983 4 4,944,700 4 23,998 0.979 8 9,889,400 8 24,099 0.975 16 19,778,012 16 24,407 0.962 32 39,555,236 32 24,481 0.960 Parallel Efficiency 𝐹𝑋 𝑜 =
𝑈(1) 𝑈(𝑜)
Cray XC30; dedicated Cray Aries network 64 nodes; 2 x 12 core Intel Xeon CPU E5-2697 v2 @ 2.7 GHz
8/22/2018
Chrono::Distributed – Wave Tank Demonstration
8/22/2018
Distributed Co-Simulation Framework
8/22/2018
Full Vehicle Simulations – Performance
Maneuver Acceleration Acceleration DLC DLC Tire model ANCF ANCF Rigid mesh Rigid mesh Domain size [m x m] 8 x 3 8 x 3 110 x 6 110 x 6 Particle radius [mm] 12.5 10.0 12.5 12.5 Number particles 283,162 591,090 6,513,518 6,513,518 Step size [ms] 0.04 0.04 0.04 0.04 Number MPI ranks 5 + 8 5 + 16 5 + 16 5 + 32 Average timing information [ms] Vehicle 0.93 0.93 0.92 0.92 Terrain 348.03 391.57 3953.67 1992.25 Tire (max overall) 3483.08 3488.10 1.46 1.41 Total 3547.53 3545.19 3966.21 2002.35
8/22/2018
Double Lane Change Maneuver on Granular Terrain
Extrapolation to higher resolution:
Cray XC30; dedicated Cray Aries network nodes: 2 x 12 core Intel Xeon CPU E5-2697 v2 @ 2.7 GHz
8/22/2018
Acceleration Maneuver on Granular Terrain
Extrapolation to higher resolution:
*Animation obtained with multi-core granular terrain simulation
Cray XC30; dedicated Cray Aries network nodes: 2 x 12 core Intel Xeon CPU E5-2697 v2 @ 2.7 GHz
8/22/2018
Conclusions and Future Work
– Chrono::Distributed scales very well in both strong and weak scaling – Good scaling characteristics carry over to the co-simulation framework – Moved the bottleneck to the FEA component:
flexible tire
– Chrono::Distributed additions & enhancements
– Ongoing parallel effort to improve performance of Chrono::FEA
8/22/2018
nicholas.olsen@wisc.edu Simulation Based Engineering Lab University of Wisconsin-Madison
Project website http://projectchrono.org Chrono animations http://sbel.wisc.edu/Animations http://projectchrono.org/gallery