Leveraging Graph to Improve Abstractive Multi-Document Summarization

Open AccessPosted Content

Leveraging Graph to Improve Abstractive Multi-Document Summarization

- 20 May 2020 -

TLDR

A neural abstractive multi-document summarization (MDS) model which can leverage well-known graph representations of documents, to more effectively process multiple input documents and produce abstractive summaries is developed.

Abstract:

Graphs that capture relations between textual units have great benefits for detecting salient information from multiple documents and generating overall coherent summaries. In this paper, we develop a neural abstractive multi-document summarization (MDS) model which can leverage well-known graph representations of documents such as similarity graph and discourse graph, to more effectively process multiple input documents and produce abstractive summaries. Our model utilizes graphs to encode documents in order to capture cross-document relations, which is crucial to summarizing long documents. Our model can also take advantage of graphs to guide the summary generation process, which is beneficial for generating coherent and concise summaries. Furthermore, pre-trained language models can be easily combined with our model, which further improve the summarization performance significantly. Empirical results on the WikiSum and MultiNews dataset show that the proposed architecture brings substantial improvements over several strong baselines.

Citations

PDF

Open Access

More filters

Posted Content

A Survey of Knowledge-Enhanced Text Generation.

Wenhao Yu, +6 more

- 09 Oct 2020 -

arXiv: Computation and Language

TL;DR: A comprehensive review of the research on knowledge-enhanced text generation over the past five years is presented, which includes two parts: (i) general methods and architectures for integrating knowledge into text generation; (ii) specific techniques and applications according to different forms of knowledge data.

...read moreread less

Posted Content

Multi-document Summarization via Deep Learning Techniques: A Survey.

Congbo Ma, +4 more

- 10 Nov 2020 -

arXiv: Computation and Language

TL;DR: This survey, the first of its kind, systematically overviews the recent deep learning based MDS models and proposes a novel taxonomy to summarize the design strategies of neural networks and conduct a comprehensive summary of the state of the art.

...read moreread less

Posted Content

MS2: Multi-Document Summarization of Medical Studies.

Jay DeYoung, +4 more

- 13 Apr 2021 -

arXiv: Computation and Language

TL;DR: This work releases MSˆ2 (Multi-Document Summarization of Medical Studies), a dataset of over 470k documents and 20K summaries derived from the scientific literature that facilitates the development of systems that can assess and aggregate contradictory evidence across multiple studies, and is the first large-scale, publicly available multi-document summarization dataset in the biomedical domain.

...read moreread less

Proceedings ArticleDOI

Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters

Ramakanth Pasunuru, +4 more

TL;DR: This paper presents an efficient graph-enhanced approach to multi-document summarization (MDS) with an encoder-decoder Transformer model that leads to significant improvements on the Multi-News dataset, overall leading to an average 1.8 ROUGE score improvement over previous work.

...read moreread less

Proceedings ArticleDOI

FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

Leonardo F. R. Ribeiro, +4 more

TL;DR: FactGraph is proposed, a method that decomposes the document and the summary into structured meaning representations (MR), which are more suitable for factuality evaluation and improves performance on identifying content verifiability errors and better captures subsentence-level factual inconsistencies.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

Attention is All you Need

Ashish Vaswani, +7 more

TL;DR: This paper proposed a simple network architecture based solely on an attention mechanism, dispensing with recurrence and convolutions entirely and achieved state-of-the-art performance on English-to-French translation.

...read moreread less

Journal ArticleDOI

Latent dirichlet allocation

David M. Blei, +2 more

- 01 Mar 2003 -

Journal of Machine Learning Research

TL;DR: This work proposes a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hofmann's aspect model.

...read moreread less

Proceedings Article

Latent Dirichlet Allocation

David M. Blei, +2 more

TL;DR: This paper proposed a generative model for text and other collections of discrete data that generalizes or improves on several previous models including naive Bayes/unigram, mixture of unigrams, and Hof-mann's aspect model, also known as probabilistic latent semantic indexing (pLSI).

...read moreread less

Proceedings ArticleDOI

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Jacob Devlin, +3 more

TL;DR: BERT as mentioned in this paper pre-trains deep bidirectional representations from unlabeled text by jointly conditioning on both left and right context in all layers, which can be fine-tuned with just one additional output layer to create state-of-the-art models for a wide range of tasks.

...read moreread less

Collapse

Leveraging Graph to Improve Abstractive Multi-Document Summarization

Citations

A Survey of Knowledge-Enhanced Text Generation.

Multi-document Summarization via Deep Learning Techniques: A Survey.

MS2: Multi-Document Summarization of Medical Studies.

Efficiently Summarizing Text and Graph Encodings of Multi-Document Clusters

FactGraph: Evaluating Factuality in Summarization with Semantic Graph Representations

References

Adam: A Method for Stochastic Optimization

Attention is All you Need

Latent dirichlet allocation

Latent Dirichlet Allocation

BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding

Related Papers (5)

Leveraging Graph to Improve Abstractive Multi-Document Summarization

Improving Neural Abstractive Document Summarization with Structural Regularization

Query-Based Abstractive Summarization Using Neural Networks

Inducing Document Structure for Aspect-based Summarization

Knowledge Graph-Augmented Abstractive Summarization with Semantic-Driven Cloze Reward