Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

Papers

PDF

Open Access

More filters

Journal Article•DOI•

A multi-objective memetic algorithm for query-oriented text summarization: Medicine texts as a case study

[...]

Jesus Sanchez-Gomez¹•Institutions (1)

University of Murcia¹

01 Jul 2022

TL;DR: In this paper , a memetic algorithm, specifically a Multi-Objective Shuffled Frog-Leaping Algorithm (MOSFLA), has been developed, implemented, and applied to solve the query-oriented extractive multi-document text summarization problem.

...read moreread less

Abstract: Automatic text summarization is a topic of great interest in many fields of knowledge. Particularly, query-oriented extractive multi-document text summarization methods have increased their importance recently, since they can automatically generate a summary according to a query given by the user. One way to address this problem is by multi-objective optimization approaches. In this paper, a memetic algorithm, specifically a Multi-Objective Shuffled Frog-Leaping Algorithm (MOSFLA) has been developed, implemented, and applied to solve the query-oriented extractive multi-document text summarization problem. Experiments have been conducted with datasets from Text Analysis Conference (TAC), and the obtained results have been evaluated with Recall-Oriented Understudy for Gisting Evaluation (ROUGE) metrics. The results have shown that the proposed approach has achieved important improvements with respect to the works of scientific literature. Specifically, 25.41%, 7.13%, and 30.22% of percentage improvements in ROUGE-1, ROUGE-2, and ROUGE-SU4 scores have been respectively reached. In addition, MOSFLA has been applied to medicine texts from the Topically Diverse Query Focus Summarization (TD-QFS) dataset as a case study.

...read moreread less

4 citations

Book Chapter•DOI•

Tweet Stream Summarization for Online Reputation Management

[...]

Jorge Carrillo-de-Albornoz¹, Enrique Amigó¹, Laura Plaza¹, Julio Gonzalo¹•Institutions (1)

National University of Distance Education¹

20 Mar 2016

TL;DR: This paper proposes a novel methodology to evaluate summaries in the context of online reputation which profits from an analogy between reputation reports and the problem of diversity in search and provides empirical evidence that incorporating priority signals may benefit this summary task.

...read moreread less

Abstract: Producing online reputation reports for an entity (company, brand, etc.) is a focused summarization task with a distinctive feature: issues that may affect the reputation of the entity take priority in the summary. In this paper we (i) propose a novel methodology to evaluate summaries in the context of online reputation which profits from an analogy between reputation reports and the problem of diversity in search; and (ii) provide empirical evidence that incorporating priority signals may benefit this summarization task.

...read moreread less

4 citations

Posted Content•

A Pilot Study of Domain Adaptation Effect for Neural Abstractive Summarization

[...]

Xinyu Hua¹, Lu Wang¹•Institutions (1)

Northeastern University¹

21 Jul 2017-arXiv: Computation and Language

TL;DR: This article study the problem of domain adaptation for neural abstractive summarization and find that the combination of in-domain and out-of-domain setup yields better summaries when indomain data is insufficient.

...read moreread less

Abstract: We study the problem of domain adaptation for neural abstractive summarization. We make initial efforts in investigating what information can be transferred to a new domain. Experimental results on news stories and opinion articles indicate that neural summarization model benefits from pre-training based on extractive summaries. We also find that the combination of in-domain and out-of-domain setup yields better summaries when in-domain data is insufficient. Further analysis shows that, the model is capable to select salient content even trained on out-of-domain data, but requires in-domain data to capture the style for a target domain.

...read moreread less

4 citations

Proceedings Article•DOI•

Subtopic-based multi-document summarization

[...]

Lin Dai¹, Ji-Liang Tang¹, Yunqing Xia²•Institutions (2)

Beijing Institute of Technology¹, Tsinghua University²

12 Jul 2009

TL;DR: This paper proposes a novel approach for multi-document summarization based on subtopic segmentation that firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic.

...read moreread less

Abstract: This paper proposes a novel approach for multi-document summarization based on subtopic segmentation. It firstly detects the subtopics in a topic, and then finds the central sentence for each subtopic. The sentences are scored based on their importance in the document and in the subtopic. Two anti-redundancy strategies are used to extract sentences to form summarization. Since our approach is intrinsically incremental, it is effective when new documents are added to the document set. Experimental results indicate that the proposed approach is effective and efficient.

...read moreread less

4 citations

Book Chapter•DOI•

Enhancing Data Warehousing with Fuzzy Technology

[...]

Ling Feng¹, Tharam S. Dillon¹•Institutions (1)

Hong Kong Polytechnic University¹

30 Aug 1999

TL;DR: F fuzzy technology is explored to provide this semantics for the summarizations and aggregates developed in data warehousing systems by providing query capabilities against such enhanced data warehouses by extensions of SQL.

...read moreread less

Abstract: A data warehouse integrates large amounts of extracted and summarized data from multiple sources for direct querying and analysis. While it provides decision makers with easy access to such historical and aggregate data, the real meaning of the data has been ignored. For example, "Whether a total sales amount 1000 items indicates a good or bad sales performance is still unclear." From the decision makers' point of view, the semantics rather than raw numbers which convey the meaning of the data is very important. In this paper, we explore fuzzy technology to provide this semantics for the summarizations and aggregates developed in data warehousing systems. A three-layered data summarization architecture, namely, quantitative (numerical) summarization, qualitative (categorical) summarization, and quantifier summarization, is proposed. To facilitate the construction of these three summarization levels, two operators are introduced. We provide query capabilities against such enhanced data warehouses by extensions of SQL.

...read moreread less

4 citations

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics