Bigger is Better Trends in super computers, super software, and - - PowerPoint PPT Presentation

bigger is better trends in super computers super software
SMART_READER_LITE
LIVE PREVIEW

Bigger is Better Trends in super computers, super software, and - - PowerPoint PPT Presentation

Bigger is Better Trends in super computers, super software, and super data Michael L. Norman, Director San Diego Supercomputer Center UC San Diego Why are supercomputers needed? The universe is famously large. Douglas Adams Complexity


slide-1
SLIDE 1

Bigger is Better Trends in super computers, super software, and super data

Michael L. Norman, Director San Diego Supercomputer Center UC San Diego

slide-2
SLIDE 2

Why are supercomputers needed?

The universe is famously large…. Douglas Adams

slide-3
SLIDE 3

Complexity and beauty on a vast range of scales

How can we possibly understand all that?

slide-4
SLIDE 4
slide-5
SLIDE 5

Equations of astrophysics fluid dynamics (non-relativistic)

Conservation of Mass Conservation of Momentum Conservation of Gas Energy Conservation of Radiation Energy Conservation of Magnetic Flux Newton’s law of Gravity Microphysics

χ κ κ ρ , , ), , (

E P

e p

slide-6
SLIDE 6

8 billion cell simulation of a molecular cloud Kritsuk et al. 2007

Is it Real or Memorex?

slide-7
SLIDE 7

Outline

  • Astrocomputing and supercomputing
  • A bit about computational methodology
  • Supercomputing technology trends
  • Exploring cosmic Renaissance with

supercomputers

slide-8
SLIDE 8

Astrocomputing and Supercomputing

  • Astrophysicists have

always been at the vanguard of supercomputing

– Martin Schwarzschild used LASL’s ENIAC for stellar evolution calculations (40s 50s) – Stirling Colgate, Jim Wilson pioneering simulations of core collapse supernovae (late 60s) – Larry Smarr 2-black hole collision (mid 70s)

“Probing Cosmic Mysteries Using Supercomputers”, Norman (1996)

slide-9
SLIDE 9

Cosmological N-body simulations * The Millenium Simulation

Springel et al. (2005)

slide-10
SLIDE 10

Gravitational N-body simulations

(N=1012, 2012) 2012 ACM Gordon Bell prize finalist

slide-11
SLIDE 11

Fluid turbulence

Yokokawa et al. (2002) 2X 4X 8X

slide-12
SLIDE 12

Astrocomputing and Data computing

  • Astronomers have

always been at the vanguard of digital data explosion

– VLA radio telescope – Hubble Space Telescope – Sloan Digital Sky Survey

slide-13
SLIDE 13

Sloan Digital Sky Survey

  • “The Cosmic Genome Project”
  • Two surveys in one

– Photometric survey in 5 bands – Spectroscopic redshift survey

  • Data is public

– 2.5 Terapixels of images – 40 TB of raw data => 120TB processed – 5 TB catalogs => 35TB in the end

  • Started in 1992, finished in 2008
  • Database and spectrograph

built at JHU (SkyServer)

The University of Chicago Princeton University The Johns Hopkins University The University of Washington New Mexico State University Fermi National Accelerator Laboratory US Naval Observatory The Japanese Participation Group The Institute for Advanced Study Max Planck Inst, Heidelberg Sloan Foundation, NSF, DOE, NASA

Slide courtesy of Alex Szalay, JHU

slide-14
SLIDE 14

SDSS 2.4m 0.12Gpixel PanSTARRS 1.8m 1.4Gpixel LSST 8.4m 3.2Gpixel

slide-15
SLIDE 15

Galaxy Survey Trends

15

T.Tyson (2010)

slide-16
SLIDE 16

A BIT ABOUT COMPUTATIONAL METHODOLOGY

How are supercomputers used?

slide-17
SLIDE 17
slide-18
SLIDE 18
slide-19
SLIDE 19

Mathematical model Consistent numerical representation Verified software implementation Validation Application to problem

  • f interest

Scientific Analysis Software engineering best practices Analytic solutions or experimental results Numerical experiment design Sensitivity analysis/ Uncertainty Quantification

slide-20
SLIDE 20
slide-21
SLIDE 21

Effect of Increased Resolution

MacLow et al. (1994)

slide-22
SLIDE 22

Effect of Additional Physics

slide-23
SLIDE 23

Effect of Increased Dimensionality

Stone and Norman (1992)

slide-24
SLIDE 24

discoveries

slide-25
SLIDE 25

TRENDS IN SUPERCOMPUTERS

slide-26
SLIDE 26

Top500 #3 Cray XT5 Jaguar (Oak Ridge, USA)

37,360 AMD Operton CPUs, 6 cores/CPU 224K cores 2.3 Pflops peak speed 3D torus interconnect

slide-27
SLIDE 27

Top500 #2 Tainhe-1A (Tianjin, China)

Hybrid CPU/GPU cluster (XEON/NVIDIA) 186K cores 4.7 Pflops peak speed Proprietary interconnect

slide-28
SLIDE 28

Top500 #1 Fujitsu K Computer (Riken, Japan)

88,000 Sparc64 CPUs, 8 cores/CPU 700K cores 11.28 Pflops peak speed Tofu interconnect (6D torus = 3D torus of 3D tori)

slide-29
SLIDE 29
slide-30
SLIDE 30
slide-31
SLIDE 31
slide-32
SLIDE 32

It’s all about the cores

Cores come in many forms

  • Multicore CPUs
  • Many core CPUs
  • GPUs

How you access them is different

  • On the compute node
  • Attached devices (GPUs,

FPGAs,

Intel 6-core CPU NVIDA GPU

slide-33
SLIDE 33

Fewer powerful cores More less powerful cores

slide-34
SLIDE 34
slide-35
SLIDE 35

Energy cost to reach Exaflop

From Peter Kogge, DARPA Exascale Study

1 10 100 1000 2005 2010 2015 2020 System Power (MW)

slide-36
SLIDE 36
slide-37
SLIDE 37

TRENDS IN SUPER DATA

slide-38
SLIDE 38

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

The Data Deluge in Science

High energy physics astronomy drug discovery genomic medicine earth sciences social sciences

slide-39
SLIDE 39

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Why is scientific research becoming data-intensive?

  • Capacity to generate, store, transmit digital data is

growing exponentially

  • digital sensors follow Moore’s Law too
  • New fields of science driven by high-throughput

gene sequencers, CCDs, and sensor nets

  • genomics, proteomics, and metagenomics
  • astronomical sky surveys
  • seismic, oceanographic, ecological “observatories”
  • Emergence of the Internet (wired and wireless)
  • remote access to data archives and collaborators
  • Supercomputers are prodigious data generators
slide-40
SLIDE 40

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Cosmological Simulation Growth (M. Norman)

  • Increase of >2000 in problem size in 16 years
  • 2x every 1.5 years  Moore’s law for

supercomputers

Year Ngrid Ncell (B) Ncpu Machine 1994 5123 1/8 512 TMC CM5 2003 10243 1 512 IBM SP3 2006 20483 8 2048 IBM SP3 2009 40963 64 16K Cray XT5 2010 64003 262 93K Cray XT5

slide-41
SLIDE 41

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Coping with the data deluge

  • Density of storage media

keeping pace with Moore’s law, but not I/O rates

  • Time to process exponentially

growing amounts of data is growing exponentially

  • Latency for random access

limited by disk read head speed

  • Key insight: flash SSD

reduces read latency by 100x

53

slide-42
SLIDE 42

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA, SAN DIEGO

Michael L. Norman Principal Investigator Director, SDSC Allan Snavely Co-Principal Investigator Project Scientist

2012: Era of Data Supercomputing Begins

slide-43
SLIDE 43

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

What is Gordon?

  • A “data-intensive” supercomputer based on SSD

flash memory and virtual shared memory

  • Emphasizes MEM and IO over FLOPS
  • A system designed to accelerate access to

massive amounts of data being generated in all fields of science, engineering, medicine, and social science

  • Went into production Feb. 2012
  • Funded by the National Science Foundation and

available as to US researchers and their foreign collaborators thru XSEDE

slide-44
SLIDE 44

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

2012: First Academic Data-Supercomputer “Gordon”

  • 16K cores/340 TF
  • 64 TB DRAM
  • 300 TB of flash SSD

memory

  • software shared

memory “supernodes”

  • Designed for “Big

Data Analytics”

slide-45
SLIDE 45

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Gordon Design: Two Driving Ideas

  • Observation #1: Data keeps getting further away

from processor cores (“red shift”)

  • Do we need a new level in the memory hierarchy?
  • Observation #2: Data-intensive applications may

be serial and difficult to parallelize

  • Wouldn’t a large, shared memory machine be better from

the standpoint of researcher productivity?

  • Rapid prototyping of new approaches to data analysis
slide-46
SLIDE 46

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

The Memory Hierarchy of a Typical Supercomputer

Shared memory programming Message passing programming

Latency Gap

Disk I/O

BIG DATA

slide-47
SLIDE 47

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

The Memory Hierarchy of Gordon

Shared memory programming Disk I/O

BIG DATA

slide-48
SLIDE 48

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

vSMP aggregation SW

Gordon 32-way Supernode

Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN Dual SB CN ION 4.8 TB flash SSD Dual WM IOP ION 4.8 TB flash SSD Dual WM IOP

slide-49
SLIDE 49

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

vSMP aggregation SW

Gordon 32-way Supernode

8 TF compute 2 TB DRAM 9.6 TB SSD, >1 Million IOPS

slide-50
SLIDE 50

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Gordon Architecture: Full Machine

  • 32 supernodes =

1024 compute nodes

  • Dual rail QDR

Infiniband network

  • 3D torus (4x4x4)
  • 4 PB rotating disk

parallel file system

  • >100 GB/s

D D D D D D

slide-51
SLIDE 51

Probing Cosmic Renaissance by Supercomputer

First Grav. Bound Objects  First Stars  First Galaxies  Reionization 100 - 1000 Myr ABB

slide-52
SLIDE 52

Cosmic Renaissance

  • 1. First Stars
  • 2. First Galaxies
  • 3. Reionization
slide-53
SLIDE 53

Simulating the first generation of stars in the universe

If large objects form via mergers of smaller

  • bjects……..Where did it

all begin?

What kind of object is formed? What is their significance?

February 2003

slide-54
SLIDE 54

Universe in a Box

slide-55
SLIDE 55

The Universe is an IVP suitable for computation

  • Globally, the universe evolves

according to the Friedmann equation

3 3 8 ) (

2 2 2

Λ + − =       ≡ a k G a a t H ρ π 

Hubble parameter mass-energy density spacetime curvature scale factor a(t) cosmological constant

slide-56
SLIDE 56

The Universe is an IVP...

  • Locally*, its contents obey:

– Newton’s laws of gravitational N-body dynamics for stars and collisionless dark matter – Euler or MHD equations for baryonic gas/plasma – Atomic and molecular processes important for the condensation of stars and galaxies from diffuse gas – Radiative transfer equation for photons

(*scales << horizon scale ~ ct)

slide-57
SLIDE 57

Gridding the Universe

  • Transformation to

comoving coordinates x=r/a(t)

a(t 1) a(t 2) a(t 3)

  • Triply-periodic

boundary conditions

But what about initial conditions?

slide-58
SLIDE 58

Baby Picture of the Universe

Image Shows Temperature Fluctuations in CMBR at 380,000 yr after BB

NASA/Princeton WMAP team ∆Τ/Τ~10-4

slide-59
SLIDE 59

Gravitational Instability: Origin

  • f Cosmic Structure

A B C A B C x x

ρ <ρ> ρ <ρ>

very small fluctuations gravity amplifies fluctuations

slide-60
SLIDE 60

Formation of First Bound Objects (Minihalos)

8 Mpc 1 billion particles/cells

slide-61
SLIDE 61

Science

slide-62
SLIDE 62

Range of scales=5 x 107

Fuld Hall IAS, Princeton

slide-63
SLIDE 63

Formation of a First Generation Star (Zoom-in on one minihalo)

slide-64
SLIDE 64

Findings and Implications

Abel, Bryan & Norman (2002; Science Express)

  • First stars are massive: ~100 M(solar)
  • Only one star forms per microgalaxy
  • They will be extraordinarily luminous and photo-

ionize the intergalactic medium

  • They will explode as supernovae, and seed the

universe with heavy elements (C, N, O, Ca, Si, Fe…..)

slide-65
SLIDE 65

Making a First Galaxy (Protogalaxy)

  • A first galaxy forms out
  • f the debris of 100-

1000 first stars pulled together by gravity

  • Heavy elements

produced by the first supernovae allow the gas to cool faster and produce the first “normal stars”

  • Radiation from the first

stars and galaxies ionizes and heats the intergalactic gas

  • Ionized and chemically

enriched gas gets ejected into space

  • All of this physics needs

to be simulated over a vast range of scales

slide-66
SLIDE 66

The Birth of a Galaxy

Wise et al. 2012a,b

slide-67
SLIDE 67

The (Violent) Birth of a Galaxy

Wise et al. 2012a,b

slide-68
SLIDE 68

The Birth of a Galaxy - Stars

slide-69
SLIDE 69

First Galaxies and Reionization

slide-70
SLIDE 70

SAN DIEGO SUPERCOMPUTER CENTER at the UNIVERSITY OF CALIFORNIA; SAN DIEGO

Cosmology simulation matter power spectrum measurement using vSMP

Source: Rick Wagner, Michael L. Norman. SDC. Used by permission. 2012

We have run two large (32003 uniform grid) simulations, with and without radiation hydrodynamics, to measure the effect of the light from the first stars on the evolution of the universe. To quantitatively compare the matter distribution of each simulation, we use radially binned 3D power spectra.

  • 2 simulations
  • 32003 uniform 3D grids
  • 244GiB+ per field
  • 15k+ files each

Individual simulations Power spectra

  • Ran existing OpenMP-

threaded code

  • ~256GiB memory used
  • ~5 ½ hours per field
  • 0 development effort

Difference

slide-71
SLIDE 71

Key messages

  • Astrocomputing and supercomputing

– Astronomers have always been on the vanguard – Astronomy applications are voracious in their computing demands

  • Technology trends

– HW: Moore’s law for supercomputing is alive and well (if not accelerating) – HW: Its all about the cores; different ways they are offered – SW: Efficient use requires heroic programming efforts – Data: new data-intensive architectures needed to cope with data deluge (Gordon)

  • Applications to Cosmic Renaissance

– First starsfirst galaxiesreionization – Suppression of DM power by Jeans smoothing