IBM Research
The IBM 2016 Speaker Recognition System
Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos
The IBM 2016 Speaker Recognition System Seyed Omid Sadjadi, Sriram - - PowerPoint PPT Presentation
IBM Research The IBM 2016 Speaker Recognition System Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos IBM Research Outline Introduction Speaker Recognition System Experimental Setup Results Conclusions 2 IBM Research
IBM Research
Seyed Omid Sadjadi, Sriram Ganapathy, Jason Pelecanos
IBM Research
2
IBM Research
3
IBM Research
4
[Heck 1998; Richardson 2015; Matějka 2016 ]
IBM Research
5
IBM Research
6
IBM Research
7
i-vector Extraction Acoustic Feats. Speech SAD Suff. Stats Dim. Reduc. T matrix LDA/NDA Score fMLLR PLDA
IBM Research
8
Speech i-vector Extraction Acoustic Feats. SAD Suff. Stats Dim. Reduc. T matrix LDA/NDA Score fMLLR PLDA
IBM Research
9
Posteriors Senones (10k) B-W Statistics
i-vector Extraction Acoustic Feats. Speech SAD Dim. Reduc. T matrix LDA/NDA Score fMLLR PLDA Suff. Stats
IBM Research
10
i-vector Extraction Acoustic Feats. Speech SAD Suff. Stats Dim. Reduc. T matrix LDA/NDA Score fMLLR PLDA
IBM Research Class 1 Class 2
global class means local k-NN means LDA NDA emphasize samples near boundary
1 1 1
i
N C C T ij i ij i ij b l l l l l i j l j i
= = = ≠
( ) ( )
( ) ( )
min , ( , ) , , ( , ) , ( , ) , ( , )
i i i i l K l l K l ij l i i i i l K l l K l
d NN i d NN j w d NN i d NN j
α α α α
= + x x x x x x x x
1 C T b i i i i
=
11
IBM Research
12
IBM Research
13
Cond. Enroll Test Mismatch #Targets #Impostors C1
No 4,034 795,995 C2
Yes 15,084 2,789,534 C3
Telephony Yes 3,989 637,850 C4
Room microphone Yes 3,637 756,775 C5 Telephony Telephony (different type) Yes 7,169 408,950
IBM Research
14
IBM Research
15
IBM Research
16
IBM Research
17
System EER [%] minDCF08 minDCF10 GMM-MFCC-LDA 2.40 0.12 0.439 GMM-MFCC-NDA 1.55 0.076 0.286 DNN-MFCC-LDA 1.02 0.045 0.168 DNN-MFCC-NDA 0.76 0.036 0.147
IBM Research
18
System EER [%] minDCF08 minDCF10 DNN-MFCC-LDA 1.02 0.045 0.168 DNN-fMLLR-LDA 0.82 0.032 0.120 DNN-MFCC-NDA 0.76 0.036 0.147 DNN-fMLLR-NDA 0.67 0.028 0.092
IBM Research
19
System #Senones EER [%] minDCF08 minDCF10 DNN-LDA 2k 1.19 0.054 0.212 DNN-NDA 0.95 0.043 0.166 DNN-LDA 4k 0.98 0.041 0.169 DNN-NDA 0.86 0.033 0.116 DNN-LDA 10k 0.82 0.032 0.120 DNN-NDA 0.67 0.028 0.092
IBM Research
20
IBM Research
21
System EER [%] minDCF08 minDCF10 GMM-MFCC-LDA 2.40 0.120 0.439 1.55 0.076 0.286 0.76 0.036 0.147 0.67 0.028 0.092
IBM Research
22
IBM Research
23