Dialogue corpora NPFL070 December 11, 2019 (NPFL070) Dialogue - - PowerPoint PPT Presentation

dialogue corpora
SMART_READER_LITE
LIVE PREVIEW

Dialogue corpora NPFL070 December 11, 2019 (NPFL070) Dialogue - - PowerPoint PPT Presentation

Dialogue corpora NPFL070 December 11, 2019 (NPFL070) Dialogue corpora December 11, 2019 1 / 26 Outline 1 Intro 2 Task oriented 3 Chit-chat 4 QA (NPFL070) Dialogue corpora December 11, 2019 2 / 26 What is dialogue Sample conversation


slide-1
SLIDE 1

Dialogue corpora

NPFL070

December 11, 2019

(NPFL070) Dialogue corpora December 11, 2019 1 / 26

slide-2
SLIDE 2

Outline

1 Intro 2 Task oriented 3 Chit-chat 4 QA

(NPFL070) Dialogue corpora December 11, 2019 2 / 26

slide-3
SLIDE 3

What is dialogue

Sample conversation

Hello, how may I help you? I am looking for a cheap restaurant in the city centre. There are over twenty cheap restaurants. Which cuisine do you prefer? I like chinese food. Golden palace is a cheap restaurant with good ratings. That sounds good, can I have an address and phone number please? ...

(NPFL070) Dialogue corpora December 11, 2019 3 / 26

slide-4
SLIDE 4

Dialogue tasks

What is the use case?

task-oriented dialogues ”chit-chat” Question Answering (QA)

Subtasks

Natural Language Understanding (NLU) Dialogue State Tracking Dialogue Policy Knowledge Base information retrieval Natural Language Generation (NLG) (ASR, TTS)

(NPFL070) Dialogue corpora December 11, 2019 4 / 26

slide-5
SLIDE 5

Typical architecture of dialogue systems

[credit: A Survey of Available Corpora for Building Data-Driven Dialogue Systems by Iulian Vlad Serban, Ryan Lowe, Peter Henderson, Laurent Charlin, Joelle Pineau]

(NPFL070) Dialogue corpora December 11, 2019 5 / 26

slide-6
SLIDE 6

Terminology

turn - one usr/system utterance slot - unit of semantic information, type=value intent - desired user goal action - system action

Example

I am looking for cheap chinese food. inform(pricerange=cheap), inform(food=chinese)

(NPFL070) Dialogue corpora December 11, 2019 6 / 26

slide-7
SLIDE 7

Evaluation

Intrinsic

NLU, State tracking - classification, i.e. accuracy, precision, recall Dialogue success - were all the requests fulfilled entity match rate - were relevant information provided? BLEU - NLG, end-to-end setups

Extrinsic

Human rating - experts, crowd platforms (can be problematic)

(NPFL070) Dialogue corpora December 11, 2019 7 / 26

slide-8
SLIDE 8

Dialogue dataset types

Modality: written, spoken, multimodal Collection process:

human-human

real/scripted

human-machine automatic (machine-machine)

domain

limited(closed) vs. open domain

(NPFL070) Dialogue corpora December 11, 2019 8 / 26

slide-9
SLIDE 9

Specific problems of dialogue data resources

the central problem: unlike vast majority of NLP tasks, dialogue management is hard to decompose into independent subtasks, as each turn in a real dialogue is extremely sensitive to the previous turn(s) as a consequence, a man-machine dialogue typically quickly diverges from an authentic dialogue the fact that a dialogue composes of a sequence of turns, each of them corresponding to a few natural language sentences (i.e., the branching factor is astronomic), implies a HUGE search space . . . . . . which is impossible to cover sufficiently by any authentic training data (some other NLP tasks such as machine translation also face huge search space, but dialogues are worse because of the sequential nature)

(NPFL070) Dialogue corpora December 11, 2019 9 / 26

slide-10
SLIDE 10

Collection process

Expert collection Good acoustic conditions, high level of control usually very costly, high quality Scripted or Wizard-of-Oz scheme

Participants still talk to the system (machine). The system is secretly controlled by another human. Desired because people behave differently when talking to machine

(NPFL070) Dialogue corpora December 11, 2019 10 / 26

slide-11
SLIDE 11

Collection process

Web crawling

fast, cheap difficult to organize prone to errors

  • ften not real dialogues (tweets and replies etc.)

Crowdsourcing

untrained workers employed through some kind of data collection platform Crowdflower, Amazon Mechanical Turk compromise in terms of cost and quality

(NPFL070) Dialogue corpora December 11, 2019 11 / 26

slide-12
SLIDE 12

Data labels

One typically needs some data labelling (for language understanding, policy decisions). audio transcriptions semantic annotation (intents), (named) entity labelling

  • ther: POS, hypotheses

experts, crowdsourcing, semi-automatic

Example

I want to fly from New York to San Francisco on Friday morning. request(from=NY,to=SF,date=Friday,time=morning) There are two airports in NYC, JFK and LaGuardia. Which one of them do you want to depart from? actions={ask airport(),inform multiple(JFK,LGA)}

(NPFL070) Dialogue corpora December 11, 2019 12 / 26

slide-13
SLIDE 13

chit-chat vs. task oriented

Task (goal) oriented systems have defined goals that should be accomplished (book a restaurant, find a flight connection, find a sightseeing place) The system’s task is to ask for the restrictions and user preferences and provide options. Usually there is a domain-specific ontology, i.e. a priori knowledge Chit-chat systems however don’t need to accomplish anything. The purpose is to mimic human behavior or keep the user entertained. Both can use knowledge bases, i.e. database of facts. There can be some overlap

(NPFL070) Dialogue corpora December 11, 2019 13 / 26

slide-14
SLIDE 14

DSTC 2 (3) (2013)

Dialogue State Tracking Challenge State = set of current slot values, possibly additional features human-computer, restaurant reservation system 3000+ dialogues DSTC 2 (2013) considered a benchmark for a long time Apart from state also turn-level annotations; language understanding = recognized slot values + intent included ASR hypotheses http://camdial.org/ mh521/dstc/

(NPFL070) Dialogue corpora December 11, 2019 14 / 26

slide-15
SLIDE 15

MultiWOZ (2018)

multi-domain, 10k+ dialogues in total state and actions annotations human-human; Wizard-of-Oz scheme http://dialogue.mi.eng.cam.ac.uk/index.php/corpus/ database included

(NPFL070) Dialogue corpora December 11, 2019 15 / 26

slide-16
SLIDE 16

DSTC 1, Let’s go

Let’s go - over 170k dialogues, transcribed DSTC1 subset of the corpus, state annotations public transport domain https://github.com/DialRC/LetsGoDataset

(NPFL070) Dialogue corpora December 11, 2019 16 / 26

slide-17
SLIDE 17

Maluuba Frames

1936 conversations collected in Wizard-of-Oz fashion Complex dialogues about flight and hotel reservations Frame tracking - generalized state tracking, considering more constraint values in parallel https://datasets.maluuba.com/Frames

(NPFL070) Dialogue corpora December 11, 2019 17 / 26

slide-18
SLIDE 18

KVRET

3031 dialogues in 3 domains car assistant and driver human-human interaction https://nlp.stanford.edu/blog/a-new-multi-turn-multi-domain- task-oriented-dialogue-dataset/

(NPFL070) Dialogue corpora December 11, 2019 18 / 26

slide-19
SLIDE 19

ATIS, DSTC6+

Air Travel information services Human-machine, 774 conversations Dialogue State Tracking Systems Technology challenge 2017 DSTC 6, 2018 DSTC 7, . . . Each year set of tracks & new dataset http://workshop.colips.org/dstc7/dstc8 proposals.html

(NPFL070) Dialogue corpora December 11, 2019 19 / 26

slide-20
SLIDE 20

Chit-chat: spoken corpora

Collected dialogues on various topics, usable also for speech recognition Switchboard (1992) - 300h, telephone speech http://groups.inf.ed.ac.uk/switchboard/ British National Corpus (1992) - 1000h, various sources http://www.natcorp.ox.ac.uk/ Ami Corpus (1997) - 100h, meeting records, good quality http://groups.inf.ed.ac.uk/ami/download/

(NPFL070) Dialogue corpora December 11, 2019 20 / 26

slide-21
SLIDE 21

Chit-chat: written corpora

Twitter customer support corpus

  • ver 3 million tweets & replies

https://www.kaggle.com/thoughtvector/customer-support-on- twitter

Ubuntu dialogue corpus

930k dialogues humans chatting about technical problems with Ubuntu operating system https://github.com/rkadlec/ubuntu-ranking-dataset-creator

(NPFL070) Dialogue corpora December 11, 2019 21 / 26

slide-22
SLIDE 22

Chit-chat: written corpora

Reddit all comments

1.7 billion comments on Reddit discussions https://www.reddit.com/r/datasets/comments/3bxlg7/ i have every publicly available reddit comment/

Movie dialog Dataset

3 million short dialogues on movie recommendations part of the bAbI project https://research.fb.com/downloads/babi/

OpenSubtitles

human-human scripted dialogues https://github.com/hongweizeng/Dialogue- Corpus/tree/master/openSubtitles

(NPFL070) Dialogue corpora December 11, 2019 22 / 26

slide-23
SLIDE 23

Natural Language Generation

Cambridge RNNLG

restaurants, hotels, laptop, TVs crowdsourced

E2E NLG data

restaurants (bigger) more complex partially based on images

(NPFL070) Dialogue corpora December 11, 2019 23 / 26

slide-24
SLIDE 24

Question Answering

knowledge retrieval text understanding, reasoning The ”dialogue” (conversation) aspect is not as important as providing the relevant facts and proving understanding.

(NPFL070) Dialogue corpora December 11, 2019 24 / 26

slide-25
SLIDE 25

Question answering

Facebook bAbI project https://research.fb.com/downloads/babi/

Sample

context: John gave a ball to Stephen. Stephen went to kitchen. Q: Where is the ball?

(NPFL070) Dialogue corpora December 11, 2019 25 / 26

slide-26
SLIDE 26

Question answering

WikiQA TREC challenges (last 2004) https://trec.nist.gov/data/qa.html Yahoo QA https://webscope.sandbox.yahoo.com/catalog.php?datatype=l

(NPFL070) Dialogue corpora December 11, 2019 26 / 26