Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism

Published in IJCAI 2018, 2018

Recommended citation: /files/2018-ijcai-multi-head.pdf

Attention mechanism has become a popular and widely used component in sequence-to-sequence models. However, previous research on neural generative dialogue systems always generates universal responses, and the attention distribution learned by the model always attends to the same semantic aspect. To solve this problem, in this paper, we propose a novel Multi-Head Attention Mechanism (MHAM) for generative dialog systems, which aims at capturing multiple semantic aspects from the user utterance. Further, a regularizer is formulated to force different attention heads to concentrate on certain aspects. The proposed mechanism leads to more informative, diverse, and relevant response generated. Experimental results show that our proposed model outperforms several strong baselines.

Download paper here

Recommended citation:

@inproceedings{Tao2018Get,
title={Get The Point of My Utterance! Learning Towards Effective Responses with Multi-Head Attention Mechanism},
author={Chongyang Tao and Shen Gao and Mingyue Shang and Wei Wu and Dongyan Zhao and Rui Yan},
booktitle={IJCAI},
year={2018}
}

Share on

Twitter Facebook LinkedIn

Shen Gao (高莘)

Share on