Home
/
Topics
/
Multi-document summarization

Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1991
1989
1987
1986
1985
1982

Papers

PDF

Open Access

More filters

Extractive Text Summarization

[...]

Namita Mittal, Basant Agarwal, Himanshu Mantri, Rahul Kumar Goyal, Manoj Kumar Jain - Show less +1 more

01 Jan 2014

TL;DR: A text summarization approach is proposed based on removal of redundant sentences which is best effective on the documents which are highly redundant and contain repetitive opinions about a topic.

...read moreread less

Abstract: Text summarization helps in reducing the size of a text while preserving its information content. In this paper, a text summarization approach is proposed based on removal of redundant sentences. Initially, each sentence from original text (input) is scored based on how much redundant the sentence is and at what extent that sentence is able to cover other sentences by itself. This approach is best effective on the documents which are highly redundant and contain repetitive opinions about a topic. The summarization takes places in two stages wherein the input of a stage is the output of previous stage and after each stage the output summary is less redundant than the previous one.

...read moreread less

6 citations

Journal Article•DOI•

Feature based cluster ranking approach for single document summarization

[...]

Aakanksha Sharaff, Mohit Jain, Geethika Modugula

14 Jan 2022-International journal of information technology

6 citations

Journal Article•DOI•

Multi-Document Summarization Using K-Means and Latent Dirichlet Allocation (LDA) – Significance Sentences

[...]

Shiva Twinandilla¹, Satriyo Adhy¹, Bayu Surarso¹, Retno Kusumaningrum¹•Institutions (1)

Diponegoro University¹

01 Jan 2018-Procedia Computer Science

TL;DR: This research proposes a novel summarization method which combines K-Means Clustering and LDA - Significance Sentences, so it can generate document summaries based on the topic and has good performance when the K-means method can cluster the document according to the topic correctly.

...read moreread less

6 citations

Proceedings Article•DOI•

Summarizing Scientific Texts: Experiments with Extractive Summarizers

[...]

P.P. Balage Filho, T.A. Salgueiro Pardo, M. das Gracas Volpe Nunes

20 Oct 2007

TL;DR: This paper enhances the summarization process with the ability to detect and appropriately treat the text structure and produces a shorter version containing all the main parts of the research.

...read moreread less

Abstract: In this paper we present experiments on scientific text summarization. From a complete text, we produce a shorter version containing all the main parts of the research. Having in mind the sophisticated structure of such texts, we show that good results can be achieved using simple extractive summarizers with some obvious improvements that consider the specificity of the text genre. Specifically, we enhance the summarization process with the ability to detect and appropriately treat the text structure.

...read moreread less

6 citations

Journal Article•DOI•

ViMs: a high-quality Vietnamese dataset for abstractive multi-document summarization

[...]

Nhi-Thao Tran¹, Minh-Quoc Nghiem¹, Nhung Thi-Hong Nguyen¹, Ngan Luu-Thuy Nguyen, Nam Van Chi¹, Dien Dinh¹ - Show less +2 more•Institutions (1)

Ho Chi Minh City University of Science¹

01 Dec 2020

TL;DR: The ViMs dataset is suitable for both training and evaluating multi-document summarization systems and verified the reliability of the dataset by using a variety of metrics including conventional Cohen’s $$\kappa $$ κ , relaxed Cohen's κ —a new metric that is proposed to make it more suitable for abstractive summarization, and ROUGE scores.

...read moreread less

Abstract: Automatic text summarization is important in this era due to the exponential growth of documents available on the Internet In the Vietnamese language, VietnameseMDS is the only publicly available dataset for this task Although the dataset has 199 clusters, there are only three documents in each cluster, which is small compared to typical datasets in English This motivates us to construct ViMs—a big and high-quality Vietnamese dataset for abstractive multi-document summarization To that end, we recruited 29 annotators and enhanced MDSWriter—an open-source annotation tool, to support the annotators in creating gold standard summaries As a result, ViMs has 600 summaries corresponding to 300 clusters of 1,945 documents We have verified the reliability of our dataset by using a variety of metrics including conventional Cohen’s $$\kappa $$ , relaxed Cohen’s $$\kappa $$ —a new metric that we propose to make it more suitable for abstractive summarization, and ROUGE scores A relaxed $$\kappa $$ score of 055 indicate that ViMs could attain moderate agreement between annotators Meanwhile, ROUGE scores are 0729 of ROUGE-1, 0507 of ROUGE-2 and 0524 of ROUGE-SU4 We have further evaluated ViMs by using three different summarization systems: TextRank, CFVi and MUSEEC Their performances are 0628, 0711 and 0732 of ROUGE-1, respectively These results show that the ViMs dataset is suitable for both training and evaluating multi-document summarization systems We have made the dataset and evaluation results of this work publicly available for research community It is noted that unlike previous work that only published the final summarization dataset, we also publish intermediate annotation results, which can be used in other NLP problems such as sentence classification

...read moreread less

6 citations

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics