On the current stage, machine learning (focus purely on the context of the words) for dialogue generation has been widely studied. Indeed, existing studies of Prendinger et al. show that addressing emotion in dialogue system can enhance user satisfaction, contribute to a more positive perception of the interaction, and lead to fewer breakdowns in dialogues. Therefore, to improve conversational agent performance, our goal is to utilize and improve Emotional Chatting Machine(ECM) that can generate appropriate responses not only in content but also in emotion.
-
Data preparation: Train sentiment classifiers on an annotated dataset and choose the best classifier for automatic annotation of our dialogue dataset
-
ECM implementation: train emotional chatting machine(ECM) using the approach described in Zhou et al.
-
Task: Given a post X = (x_1,x_2,...,x_n) and an emotion category e of the response to be generated, the goal is to generate a response Y =(y_1,y_2,...,y_m) that is coherent with the emotion category e
-
Approach: Use a variant of seq2seq model, ECM, that incorporates Emotion Embedding, Internal Memory, and External Memory. (Framework shown below)
-
ECM improvement: experiment the following approaches to improve the model performance
-
Incorporate the emotion of posts into the encoder in the training phase to allow empathic
responses in the inference phase (Rashkin et al.)
-
Use global attention mechanism (Luong et al.)
-
Utilize Transformer-based model to replace seq2seq (Devlin et al)
-
-
- - Dialogue Datasets: BNC Corpus, Twitter Dialogue dataset, Cornell Movie Corpus, Daily Dialog
- - Training data for sentiment classifier: LiveJournal tweets (large text corpus with emotional tag)
- - Sentiment Classifiers: There are multiple machine learning models introduced by Bostan et al.
[1] L. A. M. Bostan and R. Klinger, “An Analysis of Annotated Corpora for Emotion Classification in
Text,” in Proceedings of the 27th International Conference on Computational Linguistics, Santa Fe, New Mexico, USA, 2018, pp. 2104–2119.
[2] H. Zhou, M. Huang, T. Zhang, X. Zhu, and B. Liu, “Emotional Chatting Machine: Emotional
Conversation Generation with Internal and External Memory,” arXiv:1704.01074 [cs], Apr. 2017. [3]
H. Wei, Y. Zhao, and J. Ke, “Building Chatbot with Emotions,” p. 8. I. Sutskever, O. Vinyals, and Q. V. Le, “Sequence to Sequence Learning with Neural Networks,”
arXiv:1409.3215 [cs], Sep. 2014. H. Rashkin, E. M. Smith, M. Li, and Y.-L. Boureau, “I Know the Feeling: Learning to Converse with
Empathy,” arXiv:1811.00207 [cs], Oct. 2018.
[6] H. Prendinger and M. Ishizuka, “The Empathic Companion: A Character-Based Interface That
Addresses Users’ Affective States,” Applied Artificial Intelligence, vol. 19, no. 3–4, pp. 267–285, Mar. 2005.
[7] M.-T. Luong, H. Pham, and C. D. Manning, “Effective Approaches to Attention-based Neural
Machine Translation,” arXiv:1508.04025 [cs], Aug. 2015.
[8] J. Devlin, M.-W. Chang, K. Lee, and K. Toutanova, “BERT: Pre-training of Deep Bidirectional
Transformers for Language Understanding,” arXiv:1810.04805 [cs], Oct. 2018.