The Influence of Down-Sampling Strategies on SVD Word Embedding - - PowerPoint PPT Presentation

the influence of down sampling strategies on svd word
SMART_READER_LITE
LIVE PREVIEW

The Influence of Down-Sampling Strategies on SVD Word Embedding - - PowerPoint PPT Presentation

June 6 th 2019, Minneapolis, USA RepEval 2019 The Influence of Down-Sampling Strategies on SVD Word Embedding Stability Johannes Hellrich, Bernd Kampe & Udo Hahn Jena University Language & Information Engineering (JULIE) Lab Friedrich


slide-1
SLIDE 1

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 1

The Influence of Down-Sampling Strategies

  • n SVD Word Embedding Stability

Johannes Hellrich, Bernd Kampe & Udo Hahn

Jena University Language & Information Engineering (JULIE) Lab Friedrich Schiller University Jena, Jena, Germany www.julielab.de

slide-2
SLIDE 2

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 2

Typical Word Embeddings are Unstable

dog cat tiger

lots

  • f

text corpus

dog cat tiger

random embeddings final embeddings random processing

slide-3
SLIDE 3

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 3

Typical Word Embeddings are Unstable

dog cat tiger

lots

  • f

text corpus

dog cat tiger

random embeddings final embeddings random processing

slide-4
SLIDE 4

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 5

Measuring Stability

dog cat tiger dog cat tiger dog cat tiger

lots

  • f

text corpus

j@n := 1 |A| X

a∈A

| T

m∈M msw(a, n, m)|

| S

m∈M msw(a, n, m)|

slide-5
SLIDE 5

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 7

Why SVD Embeddings?

lots

  • f

text corpus

dog cat tiger

final embeddings counting SVD

food roar dog 475 156 cat 823 492 tiger 51 19

slide-6
SLIDE 6

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 8

Why SVD Embeddings?

lots

  • f

text corpus

dog cat tiger

final embeddings counting & down-sampling SVD

food roar dog 0.02 0.01 cat 0.5 0.4 tiger 0.01 0.19

Replaced with association values in SVDPPMI (Levy et al., TACL 2015)

slide-7
SLIDE 7

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 9

Why Down-Sampling?

  • Avoids over-representing frequent words
  • Closer context words are more salient than distant ones

à Increased Performance (Mikolov, NIPS 2013)

slide-8
SLIDE 8

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 10

Down-Sampling Mechanism

Weighting

  • GloVe
  • New: SVDwPPMI

Probabilistic

  • word2vec
  • SVDPPMI
slide-9
SLIDE 9

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 13

Experimental Design I/II

  • Three Corpora:
  • Corpus of Historical American English 2000s decade

(COHA; 28M tokens.)

  • English News Crawl Corpus (NEWS; 550M tokens)
  • Wikipedia (WIKI; 1.7G tokens)

à Other studies used mostly COHA-sized corpora!

slide-10
SLIDE 10

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 14

Experimental Design II/II

  • Train 10 models each with SGNS, GloVe, SVDPPMI

(none / prob. down-sampling), SVDwPPMI

  • Evaluate intrinsically with four word similarity & two

analogy test sets

  • Measure stability with j@10 for 1k most frequent

words

slide-11
SLIDE 11

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 16

Stability Results

GloVe‘s high stability (Antoniak & Mimno, TACL 2018; Wendlandt et al., NAACL 2018) is true only for small corpora

slide-12
SLIDE 12

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 17

Exemplary Accuracy Results

Wilcoxon rank-sum test shows SVDwPPMI and SGNS to be indistinguishable in accuracy over all test sets and corpora

slide-13
SLIDE 13

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 18

Conclusion

  • Typical word embeddings are unstable
  • Down-sampling details greatly affect stability
  • GloVe’s stability is worse than claimed in literature
  • SVDwPPMI embeddings provide SGNS-like performance

and perfect stability

  • See paper for additional results (and bootstrapping)
slide-14
SLIDE 14

RepEval 2019 June 6th 2019, Minneapolis, USA Johannes Hellrich, Bernd Kampe & Udo Hahn Down-Sampling and SVD Word Embedding Stability 19

The Influence of Down-Sampling Strategies

  • n SVD Word Embedding Stability

Johannes Hellrich, Bernd Kampe & Udo Hahn

Jena University Language & Information Engineering (JULIE) Lab Friedrich Schiller University Jena, Jena, Germany www.julielab.de