AMR dependency parsing with a typed semantic algebra Jonas - - PowerPoint PPT Presentation

amr dependency parsing with a typed semantic algebra
SMART_READER_LITE
LIVE PREVIEW

AMR dependency parsing with a typed semantic algebra Jonas - - PowerPoint PPT Presentation

AMR dependency parsing with a typed semantic algebra Jonas Groschwitz*^, Matthias Lindemann*, Meaghan Fowlie*, Mark Johnson^, Alexander Koller* *Saarland University ^Macquarie University ACL 2018 Melbourne, Australia July 17


slide-1
SLIDE 1

AMR dependency parsing with a typed semantic algebra

Jonas Groschwitz*^, Matthias Lindemann*, Meaghan Fowlie*, Mark Johnson^, Alexander Koller* *Saarland University ^Macquarie University ACL 2018 Melbourne, Australia July 17

slide-2
SLIDE 2

2

Matthias Lindemann Saarland University Meaghan Fowlie Saarland University Mark Johnson Macquarie University Alexander Koller Saarland University

slide-3
SLIDE 3

Abstract Meaning Representation (AMR)

3

witch cast spell

ARG0 ARG1 A R G ARG1

try

The witch tried to cast a spell

Banarescu et al. 2013

slide-4
SLIDE 4

Abstract Meaning Representation (AMR)

4

witch cast spell

ARG0 ARG1 A R G ARG1

try

The witch tried to cast a spell

Parsing Banarescu et al. 2013

slide-5
SLIDE 5

5

slide-6
SLIDE 6

6

Classic AMR parser (e.g. JAMR 2014)

witch cast spell try witch cast spell

ARG0 ARG1 A R G ARG1

try

Step 1: Predict nodes Step 2: Predict edges

The witch tried to cast a spell

slide-7
SLIDE 7

Not just nodes and edges

7

witch cast spell

ARG0 ARG1 A R G ARG1

try

slide-8
SLIDE 8

Not just nodes and edges

7

Noun Transitive verb Control verb cast

ARG0 ARG1 A R G ARG1

try spell witch

slide-9
SLIDE 9

Hidden compositional structure

8

Principle of compositionality: the meaning of a complex expression is determined by the meanings of its constituent expressions and the rules used to combine them.

The witch tried to cast a spell

witch

ARG1 A R G

S O

cast spell

ARG0 A R G 1

S O[S]

try

⊥ ⊥ ⊥

APPO APPS APPO

slide-10
SLIDE 10

Hidden compositional structure

8

Principle of compositionality: the meaning of a complex expression is determined by the meanings of its constituent expressions and the rules used to combine them.

The witch tried to cast a spell

witch

ARG1 A R G

S O

cast spell

ARG0 A R G 1

S O[S]

try

APPO APPS APPO

slide-11
SLIDE 11

Hidden compositional structure

8

Principle of compositionality: the meaning of a complex expression is determined by the meanings of its constituent expressions and the rules used to combine them.

  • Widely accepted in linguistics, long history (Frege 1800s)
  • Use this knowledge to guide machine learning!

The witch tried to cast a spell

witch

ARG1 A R G

S O

cast spell

ARG0 A R G 1

S O[S]

try

APPO APPS APPO

slide-12
SLIDE 12

9

The witch tried to cast a spell

witch cast spell try

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

APPO

APPS APPO

slide-13
SLIDE 13

9

The witch tried to cast a spell

difficult

witch cast spell try

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

APPO

APPS APPO

slide-14
SLIDE 14

9

The witch tried to cast a spell

easy (easier) equivalent

witch cast spell try

dependencies!

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

AppO AppS AppO

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

APPO

APPS APPO

slide-15
SLIDE 15

9

The witch tried to cast a spell

easy (easier) equivalent

witch cast spell try

Part 1 Part 2 dependencies!

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

AppO AppS AppO

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

APPO

APPS APPO

slide-16
SLIDE 16

Apply-Modify (AM) Algebra

10

witch

cast spell try

  • G. et al, IWCS 2017

A R G 1 A R G

S O cast

*HR algebra, Courcelle & Engelfriet 2012

slide-17
SLIDE 17

Apply-Modify (AM) Algebra

10

witch

cast spell try

  • G. et al, IWCS 2017

A R G 1 A R G

S O cast

*HR algebra, Courcelle & Engelfriet 2012

  • Empty argument slots are labeled with sources* S,O,… (subject, object,…)
slide-18
SLIDE 18

Apply-Modify (AM) Algebra

10

witch

cast spell try

  • G. et al, IWCS 2017

A R G 1 A R G

S O cast

*HR algebra, Courcelle & Engelfriet 2012

head spell argument

ARG1 A R G

cast S spell

=

AppO

  • Empty argument slots are labeled with sources* S,O,… (subject, object,…)
  • Have ‘apply’ operation for each source, e.g. APPO
slide-19
SLIDE 19

Typed AM Algebra

11

cast

ARG1 A R G

cast S spell Has type [S]

witch

cast spell try

slide-20
SLIDE 20

Typed AM Algebra

11

ARG0 A R G 1

try S O[S] Object must have type [S] cast

ARG1 A R G

cast S spell Has type [S]

witch

cast spell try

slide-21
SLIDE 21

Typed AM Algebra

11

ARG0 A R G 1

try S O[S] Object must have type [S] Matching sources automatically merge cast spell

ARG0 ARG1 A R G ARG1

try S

=

AppO

cast

ARG1 A R G

cast S spell Has type [S]

witch

cast spell try

slide-22
SLIDE 22

Apply-Modify Algebra

12

witch cast spell

ARG0 ARG1 A R G ARG1

try cast spell

ARG0 ARG1 A R G ARG1

try S witch

=

witch

cast spell try

AppS

The witch tried to cast a spell

slide-23
SLIDE 23

13

Types control reentrancies

A R G 1 A R G

S O cast

AppO

ARG0

S sleep

ARG0

sleep

A R G 1 A R G

S cast

=

*cast to sleep

Has type [S]

slide-24
SLIDE 24

13

Types control reentrancies

A R G 1 A R G

S O cast

AppO

ARG0

S sleep

ARG0

sleep

A R G 1 A R G

S cast

=

*cast to sleep

Object must have type [ ] Has type [S]

slide-25
SLIDE 25

Types control reentrancies

14

witch

ARG0 A R G 1

try S

ARG0 A R G 1

try S O[S] Object must have type [S]

AppO

Has type [ ] witch

*tried to witch

=

witch

cast spell try

slide-26
SLIDE 26

AM Dependency Trees

15

dependencies define operations, but not their order

The witch cast a spell

witch

ARG1 ARG0

S O cast spell

AppS AppO

slide-27
SLIDE 27

AM Dependency Trees

15

dependencies define operations, but not their order

The witch cast a spell

witch

ARG1 ARG0

S O cast spell

AppS AppO

here: order does not matter

slide-28
SLIDE 28

AM Dependency Trees

16

The witch tried to cast a spell

witch

ARG1 ARG0

S O cast spell

ARG0 ARG1

S O[S] try

AppO AppS AppO

here: need APPO before APPS to get reentrancies

slide-29
SLIDE 29

AM Dependency Trees

16

The witch tried to cast a spell

witch

ARG1 ARG0

S O cast spell

ARG0 ARG1

S O[S] try

AppO AppS AppO

here: need APPO before APPS to get reentrancies

  • Always need to resolve reentrancies first
  • Types encode reentrencies

➡ use type system to determine operation order

slide-30
SLIDE 30

AM Dependency Trees

16

The witch tried to cast a spell

witch

ARG1 ARG0

S O cast spell

ARG0 ARG1

S O[S] try

AppO AppS AppO

Building instructions for an AMR that we know how to predict

slide-31
SLIDE 31

17

The witch tried to cast a spell

easy (easier) equivalent

witch

cast spell try

Part 1 Part 2

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

APPO

APPS APPO

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

AppO AppS AppO

slide-32
SLIDE 32

Model

18

  • 1. Supertagging: score graph fragments for each word
  • 2. Dependency model: score operations
  • 3. Decoding: find highest-scoring well-typed tree

The witch tried to cast a spell

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

AppO AppS AppO

witch

A R G 1 ARG0 S O cast

spell

A R G ARG1 S O[S] try

AppO AppS

slide-33
SLIDE 33
  • 1. Supertagging

19

E.g. Lewis et al. (2014) for CCG The witch tried … spell w0 w1 w2 wn P0 P1 P2 Pn … bidirectional LSTM probability distribution

  • ver graph lexicon

word embeddings

ARG0 A R G 1

S O[S]

try

train to predict

slide-34
SLIDE 34
  • 2. Dependency Model

20

Kiperwasser & Goldberg (2016) for syntactic dependencies The witch tried … spell w0 w1 w2 wn P2 1 → … bidirectional LSTM probability distribution over operations word embeddings train to predict

AppS

tried witch

slide-35
SLIDE 35

21

witch cast spell

ARG1 A R G

The witch cast a spell

witch

A R G 1 ARG0 cast S O

spell

AppO AppS

AMR Corpus Required training data

The witch cast a spell

slide-36
SLIDE 36

21

witch cast spell

ARG1 A R G

The witch cast a spell

witch

A R G 1 ARG0 cast S O

spell

AppO AppS

AMR Corpus Required training data

The witch cast a spell

witch

A R G 1 ARG0

cast S O

spell

Heuristics

  • Alignments
  • Attaching edges
  • Source names
  • Source annotations

The witch cast a spell

slide-37
SLIDE 37

21

witch cast spell

ARG1 A R G

The witch cast a spell

witch

A R G 1 ARG0 cast S O

spell

AppO AppS

AMR Corpus Required training data

The witch cast a spell

witch

A R G 1 ARG0

cast S O

spell

Heuristics

  • Alignments
  • Attaching edges
  • Source names
  • Source annotations

The witch cast a spell

determine

slide-38
SLIDE 38
  • 3. Decoding

22

Find the best well-typed dependency tree

slide-39
SLIDE 39
  • 3. Decoding

22

  • ill-typed trees do not evaluate to AMRs
  • ill-typed trees to not match our linguistic intuitions

Find the best well-typed dependency tree

slide-40
SLIDE 40
  • 3. Decoding

22

  • ill-typed trees do not evaluate to AMRs
  • ill-typed trees to not match our linguistic intuitions
  • Exact typed decoding is NP-hard
  • Untyped decoding: 74% of trees are ill-typed

Find the best well-typed dependency tree

slide-41
SLIDE 41
  • 3. Decoding

22

  • ill-typed trees do not evaluate to AMRs
  • ill-typed trees to not match our linguistic intuitions
  • Exact typed decoding is NP-hard
  • Untyped decoding: 74% of trees are ill-typed

➡ Approximate decoders

Find the best well-typed dependency tree

slide-42
SLIDE 42

Approximate decoders

23

A: Fixed tree

The witch cast a spell

The witch cast a spell

witch

A R G 1 ARG0 cast S O

spell

AppO AppS

slide-43
SLIDE 43

Approximate decoders

23

  • 1. Fix unlabeled tree

A: Fixed tree

The witch cast a spell

The witch cast a spell

witch

A R G 1 ARG0 cast S O

spell

AppO AppS

  • 2. Label tree, with type checking
slide-44
SLIDE 44

Approximate decoders

23

  • 1. Fix unlabeled tree

A: Fixed tree

The witch cast a spell

The witch cast a spell

witch

A R G 1 ARG0 cast S O

spell

AppO AppS

  • 2. Label tree, with type checking

B: Projective: can only combine adjacent constituents "CKY parsing with types as nonterminals"

slide-45
SLIDE 45

Results

24

slide-46
SLIDE 46

25

Classic AMR parser (graph decoder)

witch cast spell try witch cast spell

ARG0 ARG1 A R G ARG1

try

Step 1: Predict nodes Step 2: Predict edges

slide-47
SLIDE 47

Results

26

Model Method Smatch score JAMR (Flanigan et al. 2016) graph decoder 67 Foland & Martin 2017 graph decoder 70.7 van Noord & Bos 2017 neural seq2seq 68.5 Lyu & Titov (ACL 2018) graph decoder 73.7 Our baseline graph decoder 66.1 Our projective decoder 70.2 Our fixed tree decoder 70.2

Dataset: LDC2015E86

slide-48
SLIDE 48

Conclusion

27

  • We built a competitive compositional AMR parser
  • Clear avenues to improvement
  • Update to recent advancements in training regimen (e.g. Lyu &

Tivov 2018)

  • Look into specific phenomena, e.g.
  • anaphora
  • ellipsis
  • Future work: extend method to other formalisms
slide-49
SLIDE 49

Conclusion

27

  • We built a competitive compositional AMR parser
  • Clear avenues to improvement
  • Update to recent advancements in training regimen (e.g. Lyu &

Tivov 2018)

  • Look into specific phenomena, e.g.
  • anaphora
  • ellipsis
  • Future work: extend method to other formalisms

we thank you

ARG1 ARG0