Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Fear the REAPER: A System for Automatic Multi-Document Summarization with Reinforcement Learning

[...]

Cody Rioux¹, Sadid A. Hasan², Yllias Chali¹•Institutions (2)

University of Lethbridge¹, Philips²

01 Oct 2014

TL;DR: It is shown that ROUGE results can be improved using a unigram and bigram similarity metric when training a learner to select sentences for summarization, and query focused extensions of this approach show an improvement.

...read moreread less

Abstract: This paper explores alternate algorithms, reward functions and feature sets for performing multi-document summarization using reinforcement learning with a high focus on reproducibility. We show that ROUGE results can be improved using a unigram and bigram similarity metric when training a learner to select sentences for summarization. Learners are trained to summarize document clusters based on various algorithms and reward functions and then evaluated using ROUGE. Our experiments show a statistically significant improvement of 1.33%, 1.58%, and 2.25% for ROUGE-1, ROUGE-2 and ROUGEL scores, respectively, when compared with the performance of the state of the art in automatic summarization with reinforcement learning on the DUC2004 dataset. Furthermore query focused extensions of our approach show an improvement of 1.37% and 2.31% for ROUGE-2 and ROUGE-SU4 respectively over query focused extensions of the state of the art with reinforcement learning on the DUC2006 dataset.

...read moreread less

47 citations

Proceedings Article•DOI•

Multi-document summarization by visualizing topical content

[...]

Rie Kubota Ando¹, Branimir Boguraev², Roy J. Byrd², Mary S. Neff²•Institutions (2)

Cornell University¹, IBM²

30 Apr 2000

TL;DR: This paper describes a framework for multi- document summarization which combines three premises: coherent themes can be identified reliably, highly representative themes can function as multi-document summary surrogates, and effective end-use of such themes should be facilitated by a visualization environment which clarifies the relationship between themes and documents.

...read moreread less

Abstract: This paper describes a framework for multi-document summarization which combines three premises: coherent themes can be identified reliably; highly representative themes, running across subsets of the document collection, can function as multi-document summary surrogates; and effective end-use of such themes should be facilitated by a visualization environment which clarifies the relationship between themes and documents. We present algorithms that formalize our framework, describe an implementation, and demonstrate a prototype system and interface.

...read moreread less

47 citations

Proceedings Article•

Generating Aspect-oriented Multi-Document Summarization with Event-aspect model

[...]

Peng Li¹, Yinglin Wang¹, Wei Gao², Jing Jiang³•Institutions (3)

Shanghai Jiao Tong University¹, The Chinese University of Hong Kong², Singapore Management University³

27 Jul 2011

TL;DR: Key features of this method include automatic grouping of semantically related sentences and sentence ranking based on extension of random walk model and a new sentence compression algorithm which use dependency tree instead of parser tree.

...read moreread less

Abstract: In this paper, we propose a novel approach to automatic generation of aspect-oriented summaries from multiple documents. We first develop an event-aspect LDA model to cluster sentences into aspects. We then use extended LexRank algorithm to rank the sentences in each cluster. We use Integer Linear Programming for sentence selection. Key features of our method include automatic grouping of semantically related sentences and sentence ranking based on extension of random walk model. Also, we implement a new sentence compression algorithm which use dependency tree instead of parser tree. We compare our method with four baseline methods. Quantitative evaluation based on Rouge metric demonstrates the effectiveness and advantages of our method.

...read moreread less

47 citations

Journal Article•DOI•

Ranking Through Clustering: An Integrated Approach to Multi-Document Summarization

[...]

Xiaoyan Cai¹, Wenjie Li•Institutions (1)

Northwest A&F University¹

01 Jul 2013-IEEE Transactions on Audio, Speech, and Language Processing

TL;DR: A novel approach that directly generates clusters integrated with ranking is proposed that is demonstrated by both the cluster quality analysis and the summarization evaluation conducted on the DUC 2004-2007 datasets.

...read moreread less

Abstract: Multi-document summarization aims to create a condensed summary while retaining the main characteristics of the original set of documents. Under such background, sentence ranking has hitherto been the issue of most concern. Since documents often cover a number of topic themes with each theme represented by a cluster of highly related sentences, sentence clustering has been explored in the literature in order to provide more informative summaries. For each topic theme, the rank of terms conditional on this topic theme should be very distinct, and quite different from the rank of terms in other topic themes. Existing cluster-based summarization approaches apply clustering and ranking in isolation, which leads to incomplete, or sometimes rather biased, analytical results. A newly emerged framework uses sentence clustering results to improve or refine the sentence ranking results. Under this framework, we propose a novel approach that directly generates clusters integrated with ranking in this paper. The basic idea of the approach is that ranking distribution of sentences in each cluster should be quite different from each other, which may serve as features of clusters and new clustering measures of sentences can be calculated accordingly. Meanwhile, better clustering results can achieve better ranking results. As a result, ranking and clustering by mutually and simultaneously updating each other so that the performance of both can be improved. The effectiveness of the proposed approach is demonstrated by both the cluster quality analysis and the summarization evaluation conducted on the DUC 2004-2007 datasets.

...read moreread less

46 citations

Proceedings Article•DOI•

Modeling Document Summarization as Multi-objective Optimization

[...]

Lei Huang¹, Yanxiang He¹, Furu Wei¹, Wenjie Li•Institutions (1)

Wuhan University¹

02 Apr 2010

TL;DR: This paper considers document summarization as a multi-objective optimization problem involving four objective functions, namely information coverage, significance, redundancy and text coherence, and chooses the DUC 2005 and 2006 query-oriented summarization tasks to exam the proposed model.

...read moreread less

Abstract: In this paper, we consider document summarization as a multi-objective optimization problem involving four objective functions, namely information coverage, significance, redundancy and text coherence. These functions measure the possible summaries based on the identified core terms and main topics (i.e. a cluster of semantically or statistically related core terms). We choose the DUC 2005 and 2006 query-oriented summarization tasks to exam the proposed model. The encouraging results indicate that the multi-objective optimization based framework for document summarization is truly a promising research direction.

...read moreread less

46 citations

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics