FactGen: Faithful Text Generation by Factuality-aware Pre-training and Contrastive Ranking Fine-tuning

ZhiBin Lan; Wei Li; Jinsong Su; Xinyan Xiao; Jiachen Liu; Wenhao Wu; Yajuan Lyu

doi:10.1613/jair.1.14267

PDF

Published: Apr 27, 2023

DOI: https://doi.org/10.1613/jair.1.14267

Keywords:

neural networks, natural language

ZhiBin Lan

a:1:{s:5:"en_US";s:17:"Xiamen University";}

Wei Li

Jinsong Su

Xinyan Xiao

Jiachen Liu

Wenhao Wu

Yajuan Lyu

Abstract

Conditional text generation is supposed to generate a fluent and coherent target text that is faithful to the source text. Although pre-trained models have achieved promising results, they still suffer from the crucial factuality problem. To deal with this issue, we propose a factuality-aware pretraining-finetuning framework named FactGen, which fully considers factuality during two training stages. Specifically, at the pre-training stage, we utilize a natural language inference model to construct target texts that are entailed by the source texts, resulting in a more factually consistent pre-training objective. Then, during the fine-tuning stage, we further introduce a contrastive ranking loss to encourage the model to generate factually consistent text with higher probability. Extensive experiments on three conditional text generation tasks demonstrate the effectiveness and generality of our training framework.

Issue

Vol. 76 (2023)

Section

Articles

Article Sidebar

Main Article Content

Abstract

Article Details