Improving Abstractive Dialogue Summarization with Hierarchical Pretraining and Topic Segment

November 2021

摘要

With the increasing abundance of meeting transcripts, meeting summary has attracted more and more attention from researchers. The unsupervised pre-training method based on transformer structure combined with fine-tuning of downstream tasks has achieved great success in the field of text summarization. However, the semantic structure and style of meeting transcripts are quite different from that of articles. In this work, we propose a hierarchical transformer encoder-decoder network with multi-task pre-training. Specifically, we mask key sentences at the word-level encoder and generate them at the decoder. Besides, we randomly mask some of the role alignments in the input text and force the model to recover the original role tags to complete the alignments. In addition, we introduce a topic segmentation mechanism to further improve the quality of the generated summaries. The experimental results show that our model is superior to the previous methods in meeting summary datasets AMI and ICSI.

类型

会议文章

出版物

Conference on Empirical Methods in Natural Language Processing

Improving Abstractive Dialogue Summarization with Hierarchical Pretraining and Topic Segment

摘要

付宇卓

教授博士生导师

刘婷

讲师

Improving Abstractive Dialogue Summarization with Hierarchical Pretraining and Topic Segment

摘要

付宇卓

教授 博士生导师

刘婷

讲师

教授博士生导师