以对比学习与时序递推提升摘要泛化性的方法

Journal of Chongqing University of Technology(Natural Science) ›› 2024, Vol. 38 ›› Issue (2): 170-180.

• Information and computer science • Previous Articles Next Articles

Im proving generalization of summarization with contrastive learning and temporal recursion

Online:2024-03-22 Published:2024-03-22

Abstract

Abstract: To address the problems of the traditional text summarization models trained based on cross-entropy loss functions，such as degraded performance during inference，low generalization，serious exposure bias phenomenon during generation，and low similarity between the generated summary and the reference summary text，a novel training approach is proposed in this paper.On the one hand，the model itself generates a candidate set using beam search and selects positive and negative samples based on the evaluation scores of the candidate summaries.Two sets of contrastive loss functions are built using“argmax-greedy search probability values”and“label probability values”within the output candidate set.On the other hand，a time-series recursive function designed to operate on the candidate set’s sentences guides the model to ensure temporal accuracy when outputting each individual candidate summary and mitigates exposure bias.Our experiments show the method significantly improves the generalization performance on the CNN／Daily Mail and Xsum public datasets.The Rouge and Bert Score reach 47.54 and 88.51 respectively on CNN／Daily Mail while they reach 48.75 and 92.61 on Xsum.

CLC Number:

TP391.1

[1]	. AMFRel：A method for joint extraction of entity relations in Chinese electronic medical records [J]. Journal of Chongqing University of Technology(Natural Science), 2024, 38(2): 189-197.
[2]	. FCG-NNER: A Chinese nested named entity recognition method fused with glyph information [J]. Journal of Chongqing University of Technology(Natural Science), 2023, 37(12): 222-231.
[3]	. A dynamic allocation based on attention of knowledge graph completion method [J]. Journal of Chongqing University of Technology(Natural Science), 2023, 37(8): 231-237.
[4]	. Micro-blog text emotion classification based on the fusion of content features and spread features [J]. Journal of Chongqing University of Technology(Natural Science), 2023, 37(7): 245-255.
[5]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2022, 36(1): 128-135.
[6]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2020, 34(5): 139-149.
[7]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2019, 33(6): 45-52.
[8]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2021, 35(7): 125-130.
[9]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2022, 36(9): 98-109.
[10]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2024, 38(5): 86-94.
[11]	. [J]. Journal of Chongqing University of Technology(Natural Science), 2024, 38(11): 206-212.

Im proving generalization of summarization with contrastive learning and temporal recursion

PDF (PC)

Like

Knowledge

Abstract

Cite this article

share this article

References

Related Articles 11

FCG-NNER: A Chinese nested named entity recognition method fused with glyph information

A dynamic allocation based on attention of knowledge graph completion method

Micro-blog text emotion classification based on the fusion of content features and spread features

Metrics

Comments

Recommended 0