Speaker Change Detection using Siamese Networks
- Siamese layers share their
weights
- Classifier is trained using
binary cross-entropy
- Input features are PLPs
Left Segment BLSTM Right Segment BLSTM
Same/Different
Acoustic Data Acoustic Data Classifier Siamese Left embedding Right embedding