Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Proceedings Article•

Automatic Summarization Using Terminological and Semantic Resources.

[...]

Jorge Vivaldi¹, Iria da Cunha¹, Juan-Manuel Torres-Moreno², Patricia Velázquez-Morales•Institutions (2)

Pompeu Fabra University¹, École Polytechnique de Montréal²

01 May 2010

TL;DR: A new algorithm for automatic summarization of specialized texts combining terminological and semantic resources: a term extractor and an ontology that obtains quite good results although the perception is that there is a space for improvement.

...read moreread less

Abstract: This paper presents a new algorithm for automatic summarization of specialized texts combining terminological and semantic resources: a term extractor and an ontology. The term extractor provides the list of the terms that are present in the text together their corresponding termhood. The ontology is used to calculate the semantic similarity among the terms found in the main body and those present in the document title. The general idea is to obtain a relevance score for each sentence taking into account both the termhood of the terms found in such sentence and the similarity among such terms and those terms present in the title of the document. The phrases with the highest score are chosen to take part of the final summary. We evaluate the algorithm with Rouge, comparing the resulting summaries with the summaries of other summarizers. The sentence selection algorithm was also tested as part of a standalone summarizer. In both cases it obtains quite good results although the perception is that there is a space for improvement.

...read moreread less

8 citations

Proceedings Article•DOI•

Query-focused multi-document summarization based on query-sensitive feature space

[...]

Wenpeng Yin¹, Yulong Pei¹, Fan Zhang¹, Lian'en Huang¹•Institutions (1)

Peking University¹

29 Oct 2012

TL;DR: A novel approach is proposed that integrates all query-oriented relevance, information richness and novelty requirements skillfully by treating them as sentence features, making that the finally generated summary could fully reflect the combinational effect of these properties.

...read moreread less

Abstract: Query-oriented relevance, information richness and novelty are important requirements in query-focused summarization, which, to a considerable extent, determine the summary quality Previous work either rarely took into account all above demands simultaneously or dealt with part of them in the dynamic process of choosing sentences to generate a summary In this paper, we propose a novel approach that integrates all these requirements skillfully by treating them as sentence features, making that the finally generated summary could fully reflect the combinational effect of these properties Experimental results on the DUC2005 and DUC2006 datasets demonstrate the effectiveness of our approach

...read moreread less

8 citations

Patent•

Extractive query-focused multi-document summarization

[...]

Odellia Boni¹, Guy Feigenblat¹, David Konopnicki¹, Haggai Roitman¹•Institutions (1)

IBM¹

15 Dec 2017

TL;DR: In this paper, a method, computer system, and computer program product for generating a multi-document summary is provided, which is based on a query statement and one or more documents.

...read moreread less

Abstract: A method, computer system, and computer program product for generating a multi-document summary is provided. The embodiment may include receiving a query statement, one or more documents, one or more summary constraints, and quality goals. The embodiment may include identifying one or more keywords within the query statement. The embodiment may include performing a sentence selection from the one or more documents based on the one or more identified keywords. The embodiment may include generating a plurality of candidate summaries of the one or more documents based on the performed sentence selection, the goals, and a cross entropy method. The embodiment may include calculating a quality score for each of the plurality of generated candidate summaries using a plurality of quality features. The embodiment may include selecting a candidate summary from the plurality of generated candidate summaries with the highest calculated quality score that also satisfies a quality score threshold.

...read moreread less

8 citations

Proceedings Article•DOI•

Semantic Summarization of Web Documents

[...]

Antonio d'Acierno, Vincenzo Moscato¹, Fabio Persia¹, Antonio Picariello¹, Antonio Penta - Show less +1 more•Institutions (1)

University of Naples Federico II¹

22 Sep 2010

TL;DR: In this article, a novel approach for summarizing documents retrieved from the Internet is proposed to capture the semantic nature of a document, expressed in natural language, in order to retrieve a number of RDF triplets and to cluster these ones aggregating similar information.

...read moreread less

Abstract: Documents’ summarization techniques automatically extract relevant information from different sources with respect to a list of topics: they can be profitably used by a variety of applications and in particular for automatic indexing and categorization in order to facilitate the production and delivery of new multimedia contents. In this paper we propose a novel approach for summarizing documents retrieved from the Internet: we propose to capture the semantic nature of a document, expressed in natural language, in order to retrieve a number of RDF triplets and to clusterize these ones aggregating similar information. An overview of the system and some preliminary results are described.

...read moreread less

8 citations

Proceedings Article•

Unsupervised Approach for Selecting Sentences in Query-based Summarization

[...]

Yllias Chali¹, Shafiq Joty¹•Institutions (1)

University of Lethbridge¹

17 Nov 2008

TL;DR: This work extracted several features of different types for each of the sentences in the document collection in order to measure its relevancy to the user query and experimented with two well-known unsupervised statistical machine learning techniques: K-Means and EM algorithms and evaluated their performances.

...read moreread less

Abstract: When a user is served with a ranked list of relevant documents by the standard document search engines, his search task is usually not over. He has to go through the entire document contents to judge its relevance and to find the precise piece of information he was looking for. Query-relevant summarization tries to remove the onus on the end-user by providing more condensed and direct access to relevant information. Query-relevant summarization is the task to synthesize a fluent, well-organized summary of the document collection that answers the user questions. We extracted several features of different types (i.e. lexical, lexical semantic, statistical and cosine similarity ) for each of the sentences in the document collection in order to measure its relevancy to the user query. We experimented with two well-known unsupervised statistical machine learning techniques: K-Means and EM algorithms and evaluated their performances. For all these methods of generating summaries, we have shown the effects of different kinds of features.

...read moreread less

8 citations

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics