Hardware Accelerated Similarity Search George Williams Who Am I? - - PowerPoint PPT Presentation

hardware accelerated similarity search
SMART_READER_LITE
LIVE PREVIEW

Hardware Accelerated Similarity Search George Williams Who Am I? - - PowerPoint PPT Presentation

Hardware Accelerated Similarity Search George Williams Who Am I? Director, GSI Technology Previously, Chief Data Scientist Senior Data Scientist AI Research Scientist Software Engineer Recent Headlines Convergence and Integration That was


slide-1
SLIDE 1

Hardware Accelerated Similarity Search

George Williams

slide-2
SLIDE 2

Who Am I?

Director, GSI Technology

Previously, Chief Data Scientist Senior Data Scientist AI Research Scientist Software Engineer

slide-3
SLIDE 3

Recent Headlines

slide-4
SLIDE 4

Convergence and Integration

That was then...

slide-5
SLIDE 5

This is Now: Technology Disintegration

slide-6
SLIDE 6

More Innovation Around The Corner

?

slide-7
SLIDE 7

GSI’s Similarity Search Accelerator

slide-8
SLIDE 8

Agenda

  • Chip Explosion
  • GSI Technology
  • What is Vector Similarity Search?
  • GSI’s Similarity Search Accelerator
  • Integration Case Studies: Bio, Database
  • Early Adopters Program
slide-9
SLIDE 9

Who Is GSI Technology?

slide-10
SLIDE 10

Aerospace, Government, R&D High Performance SRAM and DRAM GSI Vector Similarity Search Accelerator Chip

What We Do

slide-11
SLIDE 11

What Is Vector Similarity Search?

slide-12
SLIDE 12

What is Vector Similarity Search?

Numeric Representation Bit-vector 0110000100 Coordinates (2.3, 5.6)

slide-13
SLIDE 13

Numeric Representation Simple “Distance” Function d = Func (a, b)

What is Vector Similarity Search?

slide-14
SLIDE 14

K = 5 K = 3

Numeric Representation Simple “Distance” Function K Nearest Neighbor (Top-K)

What is Vector Similarity Search?

slide-15
SLIDE 15

K = 5 K = 3

Numeric Representation Simple “Distance” Functions K Nearest Neighbor (Top-K) Search is Computational E-Commerce, Bioinformatics

What is Vector Similarity Search?

slide-16
SLIDE 16

E-Commerce: Visual Search

slide-17
SLIDE 17

Visual Search

Binary Codes, Continuous Embeddings Euclidean, L1, Hamming, Cosine >1 Billion Images

slide-18
SLIDE 18

Visual Search: Embedding Space

slide-19
SLIDE 19

Bioinformatics: Molecule Similarity

Fingerprints Tanimoto Many Large DBs 100s GB

slide-20
SLIDE 20

Molecule Similarity: Tanimoto

Jaccard Intersection / Union


slide-21
SLIDE 21

Bioinformatics: Molecule Similarity

Drug Discovery of Novel Molecules Virtual Screening Activity (Toxicity) Prediction

slide-22
SLIDE 22

Programming Interfaces

Idiomatic SQL Integrate Into Data Pipelines Leverage Skills of Data Eng & Scientists

slide-23
SLIDE 23

Many Domains and Applications

E-Commerce / Recommendations Bioinformatics / Genomics Healthcare / Medical Records Cybersecurity / Malware Detection Computer Vision / Video Surveillance

slide-24
SLIDE 24

GSI’s Similarity Search Accelerator

slide-25
SLIDE 25

Computational Memory

slide-26
SLIDE 26

“In-Place” Associative Processing

Bit Logic

  • Programmable
  • 2 million
slide-27
SLIDE 27

Consumer Board Solution

16GB Memory

PCIe Card

128Mb 128Mb

2 Chips Per Board On Board DDR4 Main Memory SRAM Cache Per Chip

slide-28
SLIDE 28

1 Chassis (4U) Solution

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

Chassis

4 Boards Per Chassis

slide-29
SLIDE 29

Multiple Chassis Solution

One Chassis Is The Master

slide-30
SLIDE 30

Network Attached Storage

RDMA support (NAS As Data Source)

Network

..

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

Chassis

...

slide-31
SLIDE 31

Segmentation by Clustering

Offline Clustering / K-Means Avoids Full DB “Scan” Faster Performance

slide-32
SLIDE 32

Availability

Q2, 2019 Q1, 2019

demo boards mass production chip

Q4, 2018

16GB Memory

PCIe Card

128Mb 128Mb

slide-33
SLIDE 33

Weizmann Institute Case Study

“GSI’s [Accelerator] can dramatically reduce the time required to search our small molecules database...”

  • Dr. Efrat Ben-Zeev, Computational Chemist
slide-34
SLIDE 34

Weizmann Institute Case Study

Molecule Similarity Search Biovia Pipeline Pilot Application Query of 34M Molecule DB Takes 10 Minutes ! Using GSI Accelerator (estimated) Query Latency Reduced To 300ms 400 Queries In 1 Second

slide-35
SLIDE 35

Biovia Application Integration

Application Python Library C Library DRIVERS GSI Accelerator

slide-36
SLIDE 36

In-Memory Database Integration

Database C Library DRIVERS GSI Accelerator

slide-37
SLIDE 37

In-Memory: Expected Performance

Memory Speed Vector Size Throughput (imgs/ sec) MemSQL ~50GB / sec 4 KB

12.5 million images/sec

GSI ~100GB / sec 4 KB

25 million images/sec

slide-38
SLIDE 38

Early Adopters Program

Consult With Our Hardware and AI Experts Co-Development and App Integration Access to simulator and test hardware Co-Marketing Opportunity

slide-39
SLIDE 39

GSI Upcoming Events

Nov, Open Data Science Panel, Visual Search Nov, PyData ( Washington DC ) Dec, GSI Similarity Search Accelerator Workshop Coming Soon, GSI’s Tech Meetup 2019, First Chips and Boards Available

slide-40
SLIDE 40

Contact Us

Twitter: @gsitechnology
 @cgeorgewilliams Blogs: gsitechnology.com
 medium.com/gsitechnology Email: associativecomputing@gsitechnology.com

slide-41
SLIDE 41

The End. Thanks !

slide-42
SLIDE 42
slide-43
SLIDE 43

Query

Option 1: INFERENCE IN PRODUCTION

  • Image2Vec (VGG, Resnet)
  • NLP
  • LSTN

Option 2: Query Vector done by External Application

  • Fingerprint Query
  • In Memory Vector (Cosine)

Option 3: 3rd Party Inference Vector

slide-44
SLIDE 44

Single Board

Small Database Fit All Data Into Cache For Lowest Latency If Larger, Paging Occurs To Memory Cluster Techniques

16GB Memory

PCIe Card

128Mb 128Mb

slide-45
SLIDE 45

Large Database

Large Database (<1TB, Flat) Pharma, Drug Search, Weizmann Molecule Search, In-Memory

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

Chassis

...

slide-46
SLIDE 46

Multi Board Solutions

For Huge Databases ( ~ 1TB )

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

Chassis

...

For Throughput: Batch Queries Split Across Boards Chassis Host Master 
 Merges The Results

slide-47
SLIDE 47

Offline Data Preparation

Training Inference Optimize For Cache and Memory

slide-48
SLIDE 48

Clustering For Large Databases

Offline Clustering Centroids List <16GB Reduces Storage Only Centroids Are Kept Local For Real-Time Performance

slide-49
SLIDE 49

Huge Database

Exact Nearest Neighbors For Small Vectors

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

16GB Memory

PCIe Card

128Mb 128Mb

Chassis

...

Approx For Large Vectors (Quantization) Large Scale Sim Search, FAISS

slide-50
SLIDE 50

Biovia Application Integration

Biovia Application Used By Thousands of Bio-Tech Companies

slide-51
SLIDE 51

Weizmann: Load A Database

slide-52
SLIDE 52

Weizmann: 3rd Party Search

slide-53
SLIDE 53

Weizmann: Select Search Method

slide-54
SLIDE 54

Weizmann: Search

slide-55
SLIDE 55

Weizmann: Define Parameters

slide-56
SLIDE 56

Weizmann: Run Protocol