Speaker and Emotion Recognition of TV-Series Data Using Multimodal and Multitask Deep Learning
Sashi Novitasari1, Quoc Truong Do1, Sakriani Saktj1,3, Dessi Lestari2, Satoshi Nakamura1,3
1 Graduate School of Informatjon Science, Nara Instjtute of Science and Technology 2 Department of Informatjcs, Bandung Instjtute of Technology 3 RIKEN AIP 1{sashi.novitasari.si3, do.truong.dj3, ssaktj, s-nakamura}@is.naist.jp 2{dessipuji}@informatjka.org