Removing Nuisance Variables from Acoustic Word Embeddings Obtaining - PowerPoint PPT Presentation
Lisa van Staden Removing Nuisance Variables from Acoustic Word Embeddings Obtaining transcriptions is expensive and not always possible. Popular methods for speech processing rely on transcribed speech. 1 Low-Resource Speech and Language
Lisa van Staden Removing Nuisance Variables from Acoustic Word Embeddings
Obtaining transcriptions is expensive and not always possible. Popular methods for speech processing rely on transcribed speech. 1 Low-Resource Speech and Language Processing
• Query-by-Example Search: search speech using speech. We don’t always need to predict text labels: • Tasks need speech segments to be compared. 2 Tasks in LSL Processing
3 We want to map speech to these representation without using labels. Acoustic Word Embeddings
We want embeddings to be robust. Acoustic properties of speech from different speakers/sexes differ. 4 Nuisance Variables: Speaker and Sex
5 Current Models
• Improved models: Disentanglement with adverserial training. • Using embeddings in downstream tasks. • Investigate the phonetic information in embeddings. • Links to language acquisition. 6 What’s Next
Recommend
More recommend
Explore More Topics
Stay informed with curated content and fresh updates.