Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism
Published in IJCAI 2018, 2018
Recommended citation: /files/2018-ijcai-multi-head.pdf
Attention mechanism has become a popular and widely used component in sequence-to-sequence models. However, previous research on neural generative dialogue systems always generates universal responses, and the attention distribution learned by the model always attends to the same semantic aspect. To solve this problem, in this paper, we propose a novel Multi-Head Attention Mechanism (MHAM) for generative dialog systems, which aims at capturing multiple semantic aspects from the user utterance. Further, a regularizer is formulated to force different attention heads to concentrate on certain aspects. The proposed mechanism leads to more informative, diverse, and relevant response generated. Experimental results show that our proposed model outperforms several strong baselines.
Recommended citation:
@inproceedings{Tao2018Get,
title={Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism},
author={Chongyang Tao and Shen Gao and Mingyue Shang and Wei Wu and Dongyan Zhao and Rui Yan},
booktitle={IJCAI},
year={2018}
}