Representing a concept by the distribution of names of its instances
Matthijs Westera, Gemma Boleda and Sebastian Padó
Representing a concept by the distribution of names of its instances - - PowerPoint PPT Presentation
Representing a concept by the distribution of names of its instances Matthijs Westera, Gemma Boleda and Sebastian Pad Representing a concept by the distribution of names of its instances A b h i j e e t G u p t a & Matthijs
Matthijs Westera, Gemma Boleda and Sebastian Padó
Matthijs Westera, Gemma Boleda and Sebastian Padó
A b h i j e e t G u p t a &
Interest in Distributional Semantics (etc.)
Interest in Distributional Semantics (etc.)
Interest in Distributional Semantics (etc.)
Interest in Distributional Semantics (etc.)
Interest in Distributional Semantics (etc.)
8
Language and the world
9
Language and the world
… that dog ate my shoe …
10
Language and the world
… that dog ate my shoe … … a young dog is called a puppy …
11
Language and the world
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much …
12
Language and the world
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
13
Language and the world
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
14
Language and the world
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
15
Language and the world
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
16
Language and the world
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
17
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
18
Distributional Semantics (DS)
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
19
Distributional Semantics (DS)
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
20
Distributional Semantics (DS)
… that dog ate my shoe … … a young dog is called a puppy … … every cat ate too much … … when my cat was young she …
dog cat
house flat
animal red
21
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
dog cat
house flat
animal red
22
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
dog cat
house flat
animal red
23
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
dog cat
house flat
animal red
24
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
But what sort of concepts does DS model?
dog cat
house flat
animal red
25
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
But what sort of concepts does DS model?
dog cat
house flat
animal red
26
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
But what sort of concepts does DS model?
dog cat
house flat
animal red
27
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
But what sort of concepts does DS model?
dog cat
house flat
animal red
28
Westera & Boleda (2019, IWCS):
Distributional Semantics as a model of concepts?
But what sort of concepts does DS model?
dog cat
house flat
animal red
29
Westera & Boleda (2019, IWCS):
Should Distributional Semantics account for entailment?
30
Westera & Boleda (2019, IWCS):
Should Distributional Semantics account for entailment?
31
Westera & Boleda (2019, IWCS):
Should Distributional Semantics account for entailment?
32
Westera & Boleda (2019, IWCS):
Should Distributional Semantics account for entailment?
33
Westera & Boleda (2019, IWCS):
Should Distributional Semantics account for entailment?
No .
34
35
Language and the world are not perfectly aligned
36
Language and the world are not perfectly aligned
37
Language and the world are not perfectly aligned
38
Language and the world are not perfectly aligned
39
Language and the world are not perfectly aligned
Does this show in DS? Can we exploit this?
40
Language and the world are not perfectly aligned
Does this show in DS? Can we exploit this?
S
e x p r e s s i
s a r e u s e d mo r e r i g i d l y t h a n
h e r s
. .
.
41
Language and the world are not perfectly aligned
Does this show in DS? Can we exploit this?
S
e x p r e s s i
s a r e u s e d mo r e r i g i d l y t h a n
h e r s
. .
. ( K r i p k e , ‘ 8 )
42
Approach
43
Approach
44
Approach
– Predicate-based:
Word vector of a predicate that is used to denote the category.
45
Approach
– Predicate-based:
Word vector of a predicate that is used to denote the category.
– Name-based:
Centroid of the word vectors of names of instances of the category.
46
Approach
– Predicate-based:
Word vector of a predicate that is used to denote the category.
– Name-based:
Centroid of the word vectors of names of instances of the category.
E . g . , f
S c i e n t i s t , t h e w
d v e c t
“ s c i e n t i s t ”
47
Approach
– Predicate-based:
Word vector of a predicate that is used to denote the category.
– Name-based:
Centroid of the word vectors of names of instances of the category.
E . g . , f
S c i e n t i s t , t h e w
d v e c t
“ s c i e n t i s t ”
E . g . , t h e me a n
v e c t
s f
“ A l b e r t E i n s t e i n ” , “ E mmy No e t h e r ” , …
48
Approach
– Predicate-based:
Word vector of a predicate that is used to denote the category.
– Name-based:
Centroid of the word vectors of names of instances of the category.
E . g . , f
S c i e n t i s t , t h e w
d v e c t
“ s c i e n t i s t ”
E . g . , t h e me a n
v e c t
s f
“ A l b e r t E i n s t e i n ” , “ E mmy No e t h e r ” , …
Matthijs Westera, Gemma Boleda and Sebastian Padó
A b h i j e e t G u p t a &
50
Existing data/model we use
51
Existing data/model we use
– e.g., <Emmy Noether, scientist>, <Edinburgh, capital>
52
Existing data/model we use
– e.g., <Emmy Noether, scientist>, <Edinburgh, capital> – derived from WordNet’s ‘instance hyponym’ relation.
53
Existing data/model we use
– e.g., <Emmy Noether, scientist>, <Edinburgh, capital> – derived from WordNet’s ‘instance hyponym’ relation.
54
Existing data/model we use
– e.g., <Emmy Noether, scientist>, <Edinburgh, capital> – derived from WordNet’s ‘instance hyponym’ relation.
predicates we use the Google News embeddings (Mikolov, Sutskever,
et al., 2013, ANIPS).
55
Evaluation: gathering human judgments
56
Evaluation: gathering human judgments
Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR):
57
Evaluation: gathering human judgments
Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR):
58
Evaluation: gathering human judgments
Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR):
to each other?
59
Evaluation: gathering human judgments
Following Bruni, Tran and Baroni’s MEN benchmark (2012, JAIR):
to each other?
60
Crowdsource task
61
Main result
62
Main result
63
Main result
– cosine similarities from Name-based / Predicate-based
and
– aggregate scores from our human judgments
64
Main result
– cosine similarities from Name-based / Predicate-based
and
– aggregate scores from our human judgments
– Predicate-based: 0.56
65
Main result
– cosine similarities from Name-based / Predicate-based
and
– aggregate scores from our human judgments
– Predicate-based: 0.56 – Name-based: 0.74
66
Artist’s impression
67
Artist’s impression
68
How many names do we need?
69
How many names do we need?
70
How many names do we need?
S u r p r i s i n g l y f e w !
71
Entities need to be representative
72
Entities need to be representative
73
Entities need to be representative
– William Cowper – James Parkinson – Alexis Carrel – Walter Reed – William Beaumont – Joseph Lister
74
Entities need to be representative
– William Cowper – James Parkinson – Alexis Carrel – Walter Reed – William Beaumont – Joseph Lister
I n v
v e d i n WW1
75
Entities need to be representative
– William Cowper – James Parkinson – Alexis Carrel – Walter Reed – William Beaumont – Joseph Lister
I n v
v e d i n WW1 M e mb e r s
U S mi l i t a r y c
p s
76
Entities need to be representative
– William Cowper – James Parkinson – Alexis Carrel – Walter Reed – William Beaumont – Joseph Lister
Wr
e “ t h e s i e g e
c h e s t e r ” ( ? ) I n v
v e d i n WW1 M e mb e r s
U S mi l i t a r y c
p s
77
78
Discussion
79
Discussion
80
Discussion
– Name-based representations of category concepts align better
with ‘the world’ than Predicate-based representations.
81
Discussion
– Name-based representations of category concepts align better
with ‘the world’ than Predicate-based representations.
– Even a small number of (representative) names can be enough.
82
Discussion
– Name-based representations of category concepts align better
with ‘the world’ than Predicate-based representations.
– Even a small number of (representative) names can be enough.
– Not every category has named instances...
83
Discussion
– Name-based representations of category concepts align better
with ‘the world’ than Predicate-based representations.
– Even a small number of (representative) names can be enough.
– Not every category has named instances... – NLP relevance? Vs. sense disambiguation? Contextualized word
embeddings (ELMo, BERT, …)?
84
Discussion
– Name-based representations of category concepts align better
with ‘the world’ than Predicate-based representations.
– Even a small number of (representative) names can be enough.
– Not every category has named instances... – NLP relevance? Vs. sense disambiguation? Contextualized word
embeddings (ELMo, BERT, …)?
– Cognitive relevance? E.g., prototype theory?
85
This project has received funding from the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No 715154). This paper reflects the authors’ view only, and the EU is not responsible for any use that may be made of the information it contains.
86
Image sources
https://ui-ex.com/explore/whale-transparent-dark/ https://commons.wikimedia.org/wiki/File:Cowicon.svg https://commons.wikimedia.org/wiki/File:Bird_1010720_drawing.svg https://commons.wikimedia.org/wiki/File:Dog_silhouette.svg https://commons.wikimedia.org/wiki/File:Cat_silhouette_darkgray.svg https://commons.wikimedia.org/wiki/File:Frog_(example).svg https://commons.wikimedia.org/wiki/File:PeregrineFalconSilhouettes.svg https://commons.wikimedia.org/wiki/File:Common_goldfish_silhouette.svg https://commons.wikimedia.org/wiki/File:Six_weeks_old_cat_(aka).jpg https://nl.m.wikipedia.org/wiki/Bestand:Kooikerhondje_puppy.jpg https://nl.m.wikipedia.org/wiki/Bestand:Golden_Retriever_eating_crust_of_pizza.jpg https://commons.wikimedia.org/wiki/File:Cat-eating-prey.jpg
87
Where are predicates and names, anyway?
predicate name
88
Where are predicates and names, anyway?
predicate name
89
Crowdsource task
90
Crowdsource task
91
Crowdsource task instructions
92
Crowdsource task instructions
93
Why definitions?
94
Why definitions?
95
Why definitions?
judgments should be about the categories as intended by the Instantiation dataset we use.
96
Why definitions?
judgments should be about the categories as intended by the Instantiation dataset we use.
subject effect.)
97
Why definitions?
judgments should be about the categories as intended by the Instantiation dataset we use.
subject effect.)
98
Why definitions?
judgments should be about the categories as intended by the Instantiation dataset we use.
subject effect.)
– but this disadvantage is not an unfair one.
99
A closer look per ontological domain
100
A closer look per ontological domain Predicate
101
A closer look per ontological domain Predicate
102
A closer look per ontological domain Predicate
Name- based:
103
A closer look per ontological domain Predicate
Name- based:
104
Non-representative instances of ‘object’ categories
105
A closer look per ontological domain Predicate
Name- based:
106
A closer look per ontological domain Predicate
Name- based:
.54 .64