Engineering ............ design of the physical ? ....early 90s - - PowerPoint PPT Presentation

engineering design of the physical early 90s what do we
SMART_READER_LITE
LIVE PREVIEW

Engineering ............ design of the physical ? ....early 90s - - PowerPoint PPT Presentation

Engineering ............ design of the physical ? ....early 90s What do we live for? Is technology really for the greater good? Am I going to spend my life with machines? Inspiring moments.......... 1997 I am going to build my blind


slide-1
SLIDE 1

Engineering ............ design of the physical …?

slide-2
SLIDE 2

....early 90s

slide-3
SLIDE 3
slide-4
SLIDE 4
slide-5
SLIDE 5

What do we live for? Is technology really for the greater good? Am I going to spend my life with machines?

slide-6
SLIDE 6
slide-7
SLIDE 7

Inspiring moments.......... 1997

slide-8
SLIDE 8

I am going to build my blind friends

an automatic reading machine!

....but what is a reading machine? How should it look like?

slide-9
SLIDE 9

Text-to-Speech Synthesis

Was not available for Turkish in 1997

slide-10
SLIDE 10

Intention -> attention, tension (stretch out) ........ learning and growth

slide-11
SLIDE 11

How to load sound to computer.

text processing

Synthesize speech signals out of thin air

slide-12
SLIDE 12

How to load sound to computer? One obstacle at a time and I can tackle them all!

slide-13
SLIDE 13
slide-14
SLIDE 14
slide-15
SLIDE 15

?

double ... int ... String ....

slide-16
SLIDE 16

?

double ... int ... String ....

...taking regularly spaced samples SAMPLING

How should I represent these samples in my computer? How can we represent a real number (for example π) ?

slide-17
SLIDE 17

We have to round because we have limited space ...but we have so many of them!

QUANTIZATION

slide-18
SLIDE 18

Theory of sound production

slide-19
SLIDE 19

Ready for the next problem?

slide-20
SLIDE 20

println(“Rock”+”Star”);

How do we concatenate sound pieces?

slide-21
SLIDE 21

Concatenative synthesis

Record sound pieces and put them in files

slide-22
SLIDE 22

Modifying speech signals: array processing

Duration modification Pitch modification

slide-23
SLIDE 23

25

Non-uniform units based synthesis

ba ar ış Database Target:

slide-24
SLIDE 24

So, I knew what to do: Design a representative text for Turkish Selecting a few hundred sentences among 2 million that covers a large variety of language units Recording these sentences Segmenting these sentences Writing text processing tools private String readNumber(int val) Year 2000 → yi@rtutaus@nd

slide-25
SLIDE 25

So, I knew what to do:

l Writing a concatenator: l reading sound files, l modifying sound segments, l joining sound segments l Writing an automatic selector

Finally !! (....year 2000) ....first free Turkish TTS released: it was robotic but served the need ....became part of screen reader software for blind

slide-26
SLIDE 26

Let's hear it! .....freely available for download

Thierry Dutoit

slide-27
SLIDE 27

_ 100 a 400 0 100 50 150 100 90 j 200 _ 500 0 200 l 100 0 100 a 500 v 100 0 200 _ 500 k 100 0 100 e 100 R 100 0 150 @ 100 l 100 _ 100 0 80 Turkish database

slide-28
SLIDE 28

_ 100 a 400 0 100 50 150 100 90 j 200 _ 500 0 200 l 100 0 100 a 500 v 100 0 200 _ 500 k 100 0 100 E: 100 r 100 0 150 @ 100 l 100 _ 100 0 80 German database

slide-29
SLIDE 29

_ 100 a 400 0 100 50 150 100 90 j 200 _ 500 0 200 l 100 0 100 a 500 v 100 0 200 _ 500 k 100 0 100 e 100 R 100 0 150 @ 100 l 100 _ 100 0 80 French database

slide-30
SLIDE 30
slide-31
SLIDE 31

What do we live for? How can make a change for the greater good?

CS-Bridge