Teraflops for the Masses: Killer Apps of Tom orrow Pradeep K. Dubey - - PDF document

teraflops for the masses killer apps of tom orrow pradeep
SMART_READER_LITE
LIVE PREVIEW

Teraflops for the Masses: Killer Apps of Tom orrow Pradeep K. Dubey - - PDF document

Teraflops for the Masses: Killer Apps of Tom orrow Pradeep K. Dubey Senior Principal Engineer Corporate Technology Group EDGE UNC, Raleigh, May 23, 2006 Evolution continues Evolution continues Upcoming Transition Media Evolution


slide-1
SLIDE 1

1

Pradeep K. Dubey Senior Principal Engineer Corporate Technology Group EDGE UNC, Raleigh, May 23, 2006 Teraflops for the Masses: Killer Apps of Tom orrow

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 2

Scene complexity: moderate Local processing dominated Media Evolution Graphics Evolution Mining Evolution Scene complexity: large Global processing dominated Scene complexity: real-world Physical simulation dominated Modality-specific streaming Modality-aware transformation Multimodal recognition Dataset: static/structured Response: offline Dataset: dynamic, multimodal Response: interactive Dataset: massive+streaming Response: real-time

Upcoming Transition

Next Transition

Evolution continues … Evolution continues …

Workload convergence: multimodal recognition and synthesis over complex datasets Workload convergence: multimodal recognition and synthesis over complex datasets

slide-2
SLIDE 2

2

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 3

Photo-real Synthesis Real-world animation Ray tracing Global Illumination Behavioral Synthesis Physical simulation Kinematics Emotion synthesis Audio synthesis Video/Image synthesis Document synthesis

Synthesis Multimodal event/object Recognition Statistical Computing Machine Learning Clustering / Classification Model-based: Bayesian network/Markov Model Neural network / Probability networks LP/IP/QP/Stochastic Optimization

Large dataset mining Semantic Web/Grid Mining Streaming Data Mining Distributed Data Mining Content-based Retrieval Collaborative Filters Multidimensional Indexing Dimensionality Reduction Dynamic Ontologies Efficient access to large, unstructured, sparse datasets Stream Processing

Mining

Indexing

Recognition

Streaming

Graphics

Evolving tow ards m odel-based com puting Evolving tow ards m odel-based com puting

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 4

What is a tumor? Is there a tumor here?

What if the tumor progresses?

It is all about dealing efficiently with complex multimodal datasets It is all about dealing efficiently with complex multimodal datasets

Recognition Mining Synthesis

Images courtesy: http://splweb.bwh.harvard.edu:8000/pages/images_movies.html

slide-3
SLIDE 3

3

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 5

Tomorrow

What is …? What if …? Is it …?

Recognition Mining Synthesis

Create a model instance

RMS: Recognition Mining Synthesis RMS: Recognition Mining Synthesis

Model-based multimodal recognition

Find a model instance Model

Real-time analytics on dynamic, unstructured, multimodal datasets Photo-realism and physics-based animation

Today

Model-less Real-time streaming and transactions on static – structured datasets Very limited realism

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 6

Visual Input Streams

Find an existing model instance

What is …? What if …? Is it …?

Recognition Mining Synthesis

Create a new model instance I nteractive RMS ( iRMS) I nteractive RMS ( iRMS) Model Most RMS apps are about enabling interactive (real-time) RMS Loop or iRMS Most RMS apps are about enabling interactive (real-time) RMS Loop or iRMS

Synthesized Visuals Learning & Modeling

Graphics Rendering + Physical Simulation Computer Vision Reality Augmentation

slide-4
SLIDE 4

4

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 7

Going beyond media-stream encode-decode-transcode! Going beyond media-stream encode-decode-transcode!

Next-Generation Entertainm ent Next-Generation Entertainm ent

Model the ball Shade/Bounce the ball Replace the ball Find the ball RMS Primitives:

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 8

Going beyond ‘red-eye removal’ Going beyond ‘red-eye removal’

Real-tim e DCC Loop Real-tim e DCC Loop

Model the ball Shade/Bounce the ball

Mine/Track/Replace the ball

RMS Closed Loop Rendering+Physics+Vision What if … what if …

slide-5
SLIDE 5

5

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 9

Machine learning Neural networks Probabilistic reasoning

Fuzzy logic Belief networks Evolutionary computing Chaos theory

Soft Computing

Rendering Simulation

collision detection force solver global illumination …

Physics Soft Physics?

W here are w e headed …

Dynamics Constraints Constraint Dynamics

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 1 0

Pool of RMS Functions Interior-point, Spectral Bundle, SVM … Pool of Mathematical Techniques Conic optimization, Subspace projection … Benefiting Applications Real-time asset management, text mining camera stream mining, 3D graphics … RMS Com puting Core RMS Com puting Core

Unstructured Information Management Analytics Vision/Tracking Gaming

slide-6
SLIDE 6

6

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 1 1

FIMI FIMI PDE PDE NLP NLP Level Set Level Set

Computer Vision Computer Vision Physical Simulation Physical Simulation (F inancial) Analytics (F inancial) Analytics Data Mining Data Mining

Particle Particle Filtering Filtering SVM SVM Classification Classification SVM SVM Training Training IPM IPM (LP, QP) (LP, QP) Fast Marching Fast Marching Method Method Springs Springs K K-

  • Means

Means Text Text Indexer Indexer Monte Carlo Monte Carlo Body Body Tracking Tracking Face Face Detection Detection CFD CFD Face Face Cloth Cloth Portfolio Portfolio Management Management Derivative Derivative Pricing Pricing

Clustering Clustering Classification Classification Indexing

Indexing Basic matrix primitives Basic matrix primitives (dense/sparse, structured/unstructured) (dense/sparse, structured/unstructured) Integrator Integrator Basic Iterative Solver Basic Iterative Solver (Jacobi, GS, SOR) (Jacobi, GS, SOR) Direct Solver Direct Solver (Cholesky) (Cholesky) Krylov Iterative Solvers Krylov Iterative Solvers (PCG) (PCG)

Rendering Rendering

Global Global Illumination Illumination

Workload Convergence Workload Convergence RMS Primitives RMS Primitives

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 1 2

Computational Fluid Dynamics Mesh Refinement Rendering: Path Tracing Object Tracking Object Recognition Computer Vision: Depth from Stereo AI for Games: Path planning Portfolio Selection Asset Allocation Asset-Liability Management Risk Management Multi-Look Option Pricing Interest-Rate Derivative Pricing Multi-Party Auctions Differential Equations Solvers

(Parabolic, Elliptic, Hyperbolic, Finite Element Method, Stochastic)

Stochastic Optimization

(Sim Annealing, Genetic Alg, Bayes Learning)

Numerical Integration

(Monte Carlo, Quasi-MC, Gaussian)

Combinatorial Optimization

(Integer Prog, Dynamic Prog)

Convex Optimization

(LP, QP, SOCP, SDP, Network, SVM)

Iterative Solvers

(Conjugate Gradients, Gauss-Seidel, Jacobi, GMRES)

Entertainment Tomorrow Business/Finance Today

Common Core Computing Kernel

Vision Graphics Hollyw ood to W all Street Hollyw ood to W all Street

slide-7
SLIDE 7

7

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 1 3

RMS Com puting Core: Scaling to Next Generation Needs

Map-based shading SIMPLEX based linear

  • ptimization

Mass-Spring based deformation Marker-based explicit surface tracking Linear manifold based recognition/ modeling Linear Complementarity problem Low dimension classifiers Global Illumination based IPM based LP/ QP/ NLP

  • ptimization

FD/ FE/ FV based deformation Level Set based implicit surface tracking Non-linear manifolds computer vision Non-linear Complementarity High dimension classifiers

Non-Linear And Generative

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 1 4

Sum m ary

There are mass applications that require significant increase in compute density

  • There is nothing as general-purpose as physics!
  • Visual computing is a proxy of this much larger class (RMS)

These applications are not linear extensions of existing usage

  • Optimal platform for such apps should not be linear extension

either There is a significant performance difference between a brute- force CMP Vs. a smart CMP targeted for this class

  • There is significant opportunity for silicon differentiation

These apps will likely be the driver for most future technology vectors

  • Programming to processor to memory technology
slide-8
SLIDE 8

8

May 2 3 , 2 0 0 6 Pradeep K. Dubey pradeep.dubey@intel.com 1 5