Weighted Finite State Transducer (WFST) Efficient algorithms for - - PDF document

weighted finite state transducer wfst
SMART_READER_LITE
LIVE PREVIEW

Weighted Finite State Transducer (WFST) Efficient algorithms for - - PDF document

Prepared by Prof. Hui Jiang 12-11-21 (CSE6328) Weighted Finite State Transducer (WFST) Efficient algorithms for various operations. Weights Handle uncertainty in text, handwritten text, speech, image, biological sequences.


slide-1
SLIDE 1

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

1

Weighted Finite State Transducer (WFST)

· Efficient algorithms for various operations. · Weights – Handle uncertainty in text, handwritten text, speech, image, biological sequences. · Applications: – Text: pattern-matching, indexation, compression. – Speech: speech recognition, speech synthesis. – Image: image compression, filters.

Weighted Finite State Transducer (WFST)

· Transducers: · Automata/Acceptors

slide-2
SLIDE 2

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

2

WFST Definition (I)

· A path π: a sequence of transitions. – Original and destination states – Input and output labels · A semiring ≡ a ring without negation – Number set K. – Sum and Product . · Semiring examples: – Probability semiring: R, +, X. – Tropical semiring: R, min, +.

⊕ ⊗

WFST Definition (II)

· General Definitions – Alphabets: input Σ, output Δ – States: Q, initial I, final F. – Transitions: E → Q * (Σ U є) * (Δ U є) * K * Q – Initial/Final weights: λ = I → K, ρ = F → K · WFST T = (Σ, Q, I, F, E, λ, ρ):

[ ]

. and all for ]) [ ( ] [ ]) [ ( ) , (

* * ) , , , (

Δ ∈ Σ ∈ ⊗ ⊗ ⊕ =

y x n w p y x T

F y x I P

π ρ π π λ

π

slide-3
SLIDE 3

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

3

WFST Operations

· Sum · Product · Closure · Reversal · Composition · Determinization · Weight pushing · Minimization

WFST Sum

· Sum: [

] [ ] [ ]

) , ( ) , ( ) , (

2 1 2 1

y x T y x T y x T T ⊕ = ⊕

slide-4
SLIDE 4

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

4

WFST Product

· Product: [

] [ ] [ ]

) , ( ) , ( ) , (

2 2 2 1 1 1 , 2 1

2 1 2 1

y x T y x T y x T T

y y y x x x

⊗ ⊕ = ⊗

= =

WFST Closure

· Closure: [ ]

[ ]

) , ( ) , (

*

y x T y x T

n n ∞ =

⊕ =

slide-5
SLIDE 5

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

5

WFST Reversal

· Reversal: [ ]

[ ]

) ~ , ~ ( ) , ( ~ y x T y x T =

WFST Composition

· Composition: [

] [ ]

) , ]( [ ) , ( ) , (

2 1 2 1

y z T z x T y x T T

z

⊗ ⊕ = 

slide-6
SLIDE 6

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

6

WFST Composition Algorithm

WFST Determinization

· Deterministic WFST: no common input label for all outgoing transitions from any state. · Determinimization: determinizable WFST  deterministic W.

slide-7
SLIDE 7

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

7

WFST Determinization Algorithm

WFST Weights Pushing

· Weight pushing: re-distribute all weights along paths.

slide-8
SLIDE 8

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

8

WFST Minimization

· Minimize number of states and transitions of a deterministic WFST.

WFST Operations

· Composition: C = A ○ B · Determinization: D = det(C) – deterministic automaton: every state has at most

  • ne out-going transition with any given label.

· Re-weighting (Weight pushing): E = push(D) · Minimization: F = min(E)

slide-9
SLIDE 9

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

9

WFST Operations: Examples

det push min

WFST Applications

· String search/match · String conversion/ language normalization · Representing Language models and probabilistic grammar · Sentence generation

slide-10
SLIDE 10

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

10

Example I: keyword detection Example I: keyword detection: tabular search

slide-11
SLIDE 11

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

11

Example I: keyword detection: Automata Search

Example I: keyword detection: Deterministic Search

slide-12
SLIDE 12

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

12

Example I: keyword detection: Minimal Deterministic Search Example II: Context-dependent Phones

· Monophone vs. Triphone · Sentence: How do they turn out later ? · Monophones: h aw d uh dh eh t er n aw t l ai t er · Triphones: <s>-h+aw h-aw+d aw-d+uh d-uh+dh uh-dh+eh … · WFST: mapping context-independent monophones to context-dependent triphones

slide-13
SLIDE 13

Prepared by Prof. Hui Jiang (CSE6328) 12-11-21

  • Dept. of CSE, York Univ.

13

Example II: Context-dependent Phones

· A simple example with only two symbols x,y:

Example III: Representing Language model

· Representing language models as WFST · Representing HMMs as WFST · Representing overall grammar as WFST · Come back later …