Pretraining Sentiment Classifiers with Unlabeled Dialog Data
1
- Jul. 18, 2018
Toru Shimizu*1, Hayato Kobayashi*1,*2, Nobuyuki Shimizu*1
*1Yahoo Japan Corporation, *2RIKEN AIP
Pretraining Sentiment Classifiers with Unlabeled Dialog Data Jul. - - PowerPoint PPT Presentation
Pretraining Sentiment Classifiers with Unlabeled Dialog Data Jul. 18, 2018 Toru Shimizu *1 , Hayato Kobayashi *1,*2 , Nobuyuki Shimizu *1 *1 Yahoo Japan Corporation, *2 RIKEN AIP 1
1
*1Yahoo Japan Corporation, *2RIKEN AIP
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
– You will need at least 100k training records to surpass classical approaches (Hu+ 2014, Wu+ 2014) – Large-scale labeled datasets of document classification
15529
5B85225
029
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
– Language model
LSTM-RNN
!/
LSTM-RNN
/
transfer
/!
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
– Sequence autoencoder (Dai and Le 2015)
LSTM-RNN
!
transfer
LSTM-RNN LSTM-RNN
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
– Pretrain an encoder-decoder model for sentiment classifiers
– Language model – Sequence autoencoder – Distant supervision with emoji and emoticons
– Costly labeled sentiment dataset of 99.5K items – Large-scale unlabeled dialog dataset of 22.3M utterance- response pairs
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
7
, ,,! !' !, ,,(, )(
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
LSTM-RNN LSTM-RNN
LSTM-RNN
'
transfer
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
9
training validation test total Dialog data 22,300,000 10,000 50,000 22,360,000 training validation test total Sentiment data 80,591 4,000 15,000 99,591
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
10
LSTM-RNN LSTM-RNN
'' !
dist. repr.
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
embedding layer recurrent layer ht
enc
embedding layer recurrent layer ht
dec
readout layer
token ID ut token ID xt token ID yt
encoder RNN decoder RNN
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
12
encoder RNN
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
13
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
scratch converge slowly
14
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
15
LSTM-RNN LSTM-RNN
LSTM-RNN
'
transfer
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
16
LSTM-RNN
!
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
17
/ '
LSTM-RNN
/
transfer
'!/
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
18
LSTM-RNN LSTM-RNN
! !
LSTM-RNN
'
transfer
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
19
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
20
LSTM-RNN
! '!
negative
'
transfer
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
21
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
22
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
56th Annual Meeting of the Association for Computational Linguistics, 15-20 July 2018, Melbourne
24