TTS and Data Selection: Improving Systems for Low-Resource Languages - - PowerPoint PPT Presentation

tts and data selection improving systems for low resource
SMART_READER_LITE
LIVE PREVIEW

TTS and Data Selection: Improving Systems for Low-Resource Languages - - PowerPoint PPT Presentation

TTS and Data Selection: Improving Systems for Low-Resource Languages Chevy Levitan, DREU 2015 outline I. Project II. Approach III. Methods IV. Status V. Future I. Project synthesize natural, intelligible voices for low resource languages


slide-1
SLIDE 1

TTS and Data Selection: Improving Systems for Low-Resource Languages

Chevy Levitan, DREU 2015

slide-2
SLIDE 2
  • utline
  • I. Project
  • II. Approach
  • III. Methods
  • IV. Status
  • V. Future
slide-3
SLIDE 3

I.

Project

synthesize natural, intelligible voices for low resource languages using data selection

slide-4
SLIDE 4

motivation

▷ bridge the gap

slide-5
SLIDE 5

motivation

▷ bridge the gap ▷ allow for cross-language communication

slide-6
SLIDE 6

why data selection?

slide-7
SLIDE 7

HRLs vs. LRLs

★ prepared data ★ abundance of training material high quality speech systems ★ found data ★ limited training material low quality speech systems

slide-8
SLIDE 8
  • A. filter out unwanted data from

training set

slide-9
SLIDE 9
  • A. filter out unwanted data from

training set

  • B. supplement limited LRL data with

choice data from similar HRL

slide-10
SLIDE 10

II.

APPROACH

preparing the experiment

slide-11
SLIDE 11

▷ Boston Radio News Corpus ▷ pre-processed ▷ English

corpus

slide-12
SLIDE 12

data selection process

extract features sort values create subsets synthesize data

slide-13
SLIDE 13

evaluate.

slide-14
SLIDE 14

evaluate.

compare/contrast voices

slide-15
SLIDE 15

example

VOICE 1 VOICE 2

slide-16
SLIDE 16

solution

  • 1. subset data
  • 2. complete dataset
slide-17
SLIDE 17

III.

METHODS

testing our hypothesis

slide-18
SLIDE 18

★ follow standard procedures for evaluating TTS voices

standards

slide-19
SLIDE 19

★ follow standard procedures for evaluating TTS voices ★ successful voice = intelligible + natural

standards

slide-20
SLIDE 20

★ follow standard procedures for evaluating TTS voices ★ successful voice = intelligible + natural ★ use crowdsourcing for unbiased results

standards

slide-21
SLIDE 21

Intelligibility

➔ transcribe nonsense sentences ➔ accurate transcription = intelligible voice

mechanical turk

slide-22
SLIDE 22

Intelligibility

➔ transcribe nonsense sentences ➔ accurate transcription = intelligible voice

mechanical turk

Naturalness

➔ use Likert scale to rate voices from very unnatural to very natural ➔ identify the voices are categorized as natural+

slide-23
SLIDE 23
slide-24
SLIDE 24

IV.

STATUS

  • ur current state
slide-25
SLIDE 25

✓ create subsets

intelligibility HIT

slide-26
SLIDE 26

✓ create subsets ✓ synthesize voices with this data

intelligibility HIT

slide-27
SLIDE 27

✓ create subsets ✓ synthesize voices with this data ✓ design and implement HIT

intelligibility HIT

slide-28
SLIDE 28

✓ create subsets ✓ synthesize voices with this data ✓ design and implement HIT ✓ publish on MTurk site

intelligibility HIT

slide-29
SLIDE 29

✓ create subsets ✓ synthesize voices with this data ✓ design and implement HIT ✓ publish on MTurk site ✓ workers complete HITs

intelligibility HIT

slide-30
SLIDE 30

✓ created subsets ✓ synthesized voices with this data ✓ design and implement HIT ✓ publish on MTurk site ✓ workers complete HITs ✓ accept/reject work

intelligibility HIT

slide-31
SLIDE 31

✓ create subsets

naturalness HIT

slide-32
SLIDE 32

✓ create subsets ✓ synthesize voices with this data

naturalness HIT

slide-33
SLIDE 33

✓ create subsets ✓ synthesize voices with this data ✓ design and implement HIT

naturalness HIT

slide-34
SLIDE 34

✓ create subsets ✓ synthesize voices with this data ✓ design and implement HIT

  • publish on MTurk site
  • workers complete HITs
  • accept/reject work

naturalness HIT

slide-35
SLIDE 35

V.

FUTURE

further exploration of this research

slide-36
SLIDE 36

evaluation

analyze mechanical turk responses

slide-37
SLIDE 37

evaluation

analyze mechanical turk responses

low-resource

implement data selection for LRLs

slide-38
SLIDE 38

evaluation

analyze mechanical turk responses

text

apply similar methods to automatically select text data

low-resource

implement data selection for LRLs

slide-39
SLIDE 39

Thanks!

Any questions?