arxiv:2111.09525

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Published on Nov 18, 2021

Authors:

Philippe Laban ,

Abstract

SummaCConv, a method that segments and aggregates scores from NLI models at the sentence level, significantly improves their effectiveness for document-level inconsistency detection, achieving state-of-the-art results on the SummaC benchmark.

AI-generated summary

In the summarization domain, a key requirement for summaries is to be factually consistent with the input document. Previous work has found that natural language inference (NLI) models do not perform competitively when applied to inconsistency detection. In this work, we revisit the use of NLI for inconsistency detection, finding that past work suffered from a mismatch in input granularity between NLI datasets (sentence-level), and inconsistency detection (document level). We provide a highly effective and light-weight method called SummaCConv that enables NLI models to be successfully used for this task by segmenting documents into sentence units and aggregating scores between pairs of sentences. On our newly introduced benchmark called SummaC (Summary Consistency) consisting of six large inconsistency detection datasets, SummaCConv obtains state-of-the-art results with a balanced accuracy of 74.4%, a 5% point improvement compared to prior work. We make the models and datasets available: https://github.com/tingofurro/summac

View arXiv page View PDF Add to collection

Community

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Upvote

SummaC: Re-Visiting NLI-based Models for Inconsistency Detection in Summarization

Abstract

Community

Models citing this paper 1

Datasets citing this paper 1

Spaces citing this paper 1

Collections including this paper 2