Asynchronous Distributed-Memory Task-Parallel Algorithm for Compressible Flows on 3D Unstructured Grids
- J. Bakosi, M. Charest, A. Pandare, J. Waltz
Asynchronous Distributed-Memory Task-Parallel Algorithm for - - PowerPoint PPT Presentation
Asynchronous Distributed-Memory Task-Parallel Algorithm for Compressible Flows on 3D Unstructured Grids J. Bakosi, M. Charest, A. Pandare , J. Waltz Los Alamos National Laboratory, Los Alamos, NM, USA October 20, 2020 LA-UR-20-28309 Project
◮ shocked flow over surrogate reentry bodies ◮ blast loading on vehicles or other complex structures ◮ weapons effects calculations in urban environments
◮ external flows over complex 3D geometries ◮ high-speed compressible flow
◮ complex domain must be explicitly meshed (rather than modeled) ◮ multiple orders of magnitude larger computational meshes ◮ larger demand for HPC: O(109) cells, O(104) CPUs must be routine calculations
10
10
10
log(h) 10
10
10
10
log(L2) ρ ρu1 ρu2 ρu3 ρE 2nd order
0.2 0.4 0.6 0.8 1 1.2 x 1 2 3 4 density Mesh 1 Mesh 2 Mesh 3 Mesh 4 semi-analytic 10
10
Log(h) 10
10
Log(L1) rho (Slope = 0.9592) 1st order
0.2 0.4 0.6 0.8 1 1.2 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/c Surface pressure coefficient at 20% semispan experiment computation (coarse mesh) computation (finer mesh)
0.2 0.4 0.6 0.8 1 1.2 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/c Surface pressure coefficient at 44% semispan experiment computation (coarse mesh) computation (finer mesh)
0.5 1 1.5 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/c Surface pressure coefficient at 65% semispan experiment computation (coarse mesh) computation (finer mesh)
0.2 0.4 0.6 0.8 1 1.2 1.4 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/c Surface pressure coefficient at 80% semispan experiment computation (coarse mesh) computation (finer mesh)
0.2 0.4 0.6 0.8 1 1.2 1.4 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/c Surface pressure coefficient at 90% semispan experiment computation (coarse mesh) computation (finer mesh)
0.2 0.4 0.6 0.8 1 1.2 1.4 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1
x/c Surface pressure coefficient at 95% semispan experiment computation (coarse mesh) computation (finer mesh)
360 900 1800 3600 7200 14400 28800 36000 900 1800 3600 7200 14400 28800 50400
2
3
4
5
1
2
3
4
CG, non-SMP CG, SMP DG(P1), non-SMP DG(P1), SMP ALECG, non-SMP ALECG, SMP ideal
100 200 300 400 500 time step 5000 10000 15000 20000 grind-time, ms/timestep no extra load, virt=0, noLB no extra load, virt=100x, noLB extra load, virt=0, noLB extra load, virt=10x, GreedyCommLB extra load, virt=100x, GreedyCommLB extra load, virt=100x, DistributedLB extra load, virt=100x, NeighborLB