Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1
CSE 6240: Web Search and Text Mining. Spring 2020
Node Representation Learning
- Prof. Srijan Kumar
Node Representation Learning Prof. Srijan Kumar - - PowerPoint PPT Presentation
CSE 6240: Web Search and Text Mining. Spring 2020 Node Representation Learning Prof. Srijan Kumar http://cc.gatech.edu/~srijan 1 Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining Administrivia Project midterm
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
1
CSE 6240: Web Search and Text Mining. Spring 2020
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
2
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
3
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
4
Machine Learning
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
5
interactome
Image from: Ganapathiraju et al. 2016. Schizophrenia interactome with 504 novel protein–protein interactions. Nature.
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
6
Machine Learning
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
7
Raw Data Structured Data Learning Algorithm Model Downstream task Feature Engineering
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
8
Feature representation, embedding
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
9
17
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
10
Image from: Perozzi et al. DeepWalk: Online Learning of Social Representations. KDD 2014.
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
11
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
12
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
13
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
14
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
15
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
16
v zu
in the original network Similarity of the embedding
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
17
in the original network Similarity of the embedding
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
18
v zu
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
19
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
20
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
21
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
22
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
23
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
24
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
25
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
26
sum over all nodes 𝑣 sum over nodes 𝑤 seen on random walks starting from 𝑣 predicted probability of 𝑣 and 𝑤 co-occuring on random walk
u2V
v2NR(u)
u zv)
n2V exp(z> u zn)
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
27
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
28
Image from: Perozzi et al. DeepWalk: Online Learning of Social Representations. KDD 2014.
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
29
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
30
u s3 s2
s1
s4 s8 s9 s6 s7 s5
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
31
u s3 s2
s1
s4 s8 s9 s6 s7 s5
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
32
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
33
Back to 𝒕𝟐 Same distance to 𝒕𝟐 Farther from 𝒕𝟐
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
34
1 1/𝑟 1/𝑞
1/𝑟
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
35
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
36
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
37
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
38
X)
difference between the embeddings:
– Concatenate: 𝑔(𝑨U, 𝑨
X)= ([𝑨U, 𝑨 X])
– Hadamard: 𝑔(𝑨U, 𝑨
X)= (𝑨U ∗ 𝑨 X) (per coordinate product)
– Sum/Avg: 𝑔(𝑨U, 𝑨
X)= (𝑨U + 𝑨 X)
– Distance: 𝑔(𝑨U, 𝑨
X)= (||𝑨U − 𝑨 X||@)
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
39
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
40
Srijan Kumar, Georgia Tech, CSE6240 Spring 2020: Web Search and Text Mining
41