Home
/
Topics
/
Multi-document summarization

Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1991
1989
1987
1986
1985
1982

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Two New Datasets for Italian-Language Abstractive Text Summarization

[...]

Nicola Landro, Ignazio Gallo, Riccardo La Grassa, Edoardo Federici

29 Apr 2022-Information

TL;DR: This work proposes two new original datasets collected from two Italian news websites with multi-sentence summaries and corresponding articles, and from a dataset obtained by machine translation of a Spanish summarization dataset, which demonstrated the superiority of the models obtained from the proposed datasets.

...read moreread less

Abstract: Text summarization aims to produce a short summary containing relevant parts from a given text. Due to the lack of data for abstractive summarization on low-resource languages such as Italian, we propose two new original datasets collected from two Italian news websites with multi-sentence summaries and corresponding articles, and from a dataset obtained by machine translation of a Spanish summarization dataset. These two datasets are currently the only two available in Italian for this task. To evaluate the quality of these two datasets, we used them to train a T5-base model and an mBART model, obtaining good results with both. To better evaluate the results obtained, we also compared the same models trained on automatically translated datasets, and the resulting summaries in the same training language, with the automatically translated summaries, which demonstrated the superiority of the models obtained from the proposed datasets.

...read moreread less

1 citations

Journal Article•DOI•

Information overlap in multilingual wikipedia and summarization

[...]

Elena Filatova¹•Institutions (1)

Fordham University¹

01 Dec 2012-International Journal of Cooperative Information Systems

TL;DR: This paper analyzes the regularities of information overlap among the articles about the same Wikipedia entry written in different languages and introduces a hypothesis that the structure of this information overlap is similar to the information overlap structure (pyramid model) used in summarization evaluation.

...read moreread less

Abstract: Wikipedia is used as a training corpus for many information selection tasks: summarization, question-answering, etc. The information presented in Wikipedia articles as well as the order in which this information is presented, is treated as the gold standard and is used for improving the quality of information selection systems. However, the Wikipedia articles corresponding to the same entry (person, location, event, etc.) written in different languages have substantial differences regarding what information is included in these articles. In this paper we analyze the regularities of information overlap among the articles about the same Wikipedia entry written in different languages: some information facts are covered in the Wikipedia articles in many languages, while others are covered only in a few languages. We introduce a hypothesis that the structure of this information overlap is similar to the information overlap structure (pyramid model) used in summarization evaluation, as well as the information o...

...read moreread less

1 citations

Journal Article•

Research of Automatic Summarization Methods

[...]

Song Ji-hua

01 Jan 2011-Computer Technology and Development

TL;DR: This paper summarizes the main automatic abstracting research methods and strategies and divides the methods into three major categories: automatically extracted summarization, automatic summarization based on information extraction and summarizing based on understanding.

...read moreread less

Abstract: It summarizes the main automatic abstracting research methods and strategies and divides the methods into three major categories: automatically extracted summarization,automatic summarization based on information extraction and summarization based on understanding.Automatically extracted method uses that extract important sentences from the article to form a digest;Abstract based on information extraction method uses that extract information from the article to fill framework which has been prepared,and then use the template to output the content;Abstract based on understanding is to use natural language processing technology to generate abstracts.focuses on automatically extracted summarization from single theme articles and multi-topic articles.After comparing advantages and disadvantages of variety of algorithms,a new multi-topic classification method is proposed.

...read moreread less

1 citations

Book Chapter•DOI•

Enhancing Sentence Ordering by Hierarchical Topic Modeling for Multi-document Summarization

[...]

Guangbing Yang¹, Kinshuk², Dunwei Wen², Erkki Sutinen¹•Institutions (2)

University of Eastern Finland¹, Athabasca University²

24 Nov 2013

TL;DR: This study proposes a novel approach that is built upon a hierarchical topic model for automatic evaluation of sentence ordering that is able to automatically evaluate sentences to find a plausible order to arrange them for generating a more readable summary.

...read moreread less

Abstract: The sentence ordering is a difficult but very important task in multi-document summarization. With the aim of producing a coherent and legible summary for multiple documents, this study proposes a novel approach that is built upon a hierarchical topic model for automatic evaluation of sentence ordering. By learning topic correlations from the topic hierarchies, this model is able to automatically evaluate sentences to find a plausible order to arrange them for generating a more readable summary. The experimental results demonstrate that our proposed approach can improve the summarization performance and present a significant enhancement on the sentence ordering for multi-document summarization. In addition, the experimental results show that our model can automatically analyze the topic relationships to infer a strategy for sentence ordering. Human evaluations justify that the generated summaries, which implement this strategy, demonstrate a good linguistic performance in terms of coherence, readability, and redundancy.

...read moreread less

1 citations

Proceedings Article•

Context-based Text Summarization for Business News Articles.

[...]

Hisham Al-Mubaid

01 Jan 2003

1 citations

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics