Home
/
Topics
/
Multi-document summarization

Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1991
1989
1987
1986
1985
1982

Papers

PDF

Open Access

More filters

Proceedings Article•DOI•

Improving the Similarity Measure of Determinantal Point Processes for Extractive Multi-Document Summarization

[...]

Sangwoo Cho¹, Logan Lebanoff¹, Hassan Foroosh¹, Fei Liu¹•Institutions (1)

University of Central Florida¹

01 Jul 2019

TL;DR: This paper proposed a similarity measure inspired by capsule networks to measure redundancy between a pair of sentences based on surface form and semantic information and showed that the improved similarity measure performs competitively, outperforming strong summarization baselines on benchmark datasets.

...read moreread less

Abstract: The most important obstacles facing multi-document summarization include excessive redundancy in source descriptions and the looming shortage of training data. These obstacles prevent encoder-decoder models from being used directly, but optimization-based methods such as determinantal point processes (DPPs) are known to handle them well. In this paper we seek to strengthen a DPP-based method for extractive multi-document summarization by presenting a novel similarity measure inspired by capsule networks. The approach measures redundancy between a pair of sentences based on surface form and semantic information. We show that our DPP system with improved similarity measure performs competitively, outperforming strong summarization baselines on benchmark datasets. Our findings are particularly meaningful for summarizing documents created by multiple authors containing redundant yet lexically diverse expressions.

...read moreread less

46 citations

Proceedings Article•DOI•

Clustering Sentences with Density Peaks for Multi-document Summarization

[...]

Yang Zhang¹, Yunqing Xia², Yi Liu, Wenmin Wang¹•Institutions (2)

Peking University¹, Tsinghua University²

01 Jan 2015

TL;DR: This work proposes a unified sentence scoring model which measures representativeness and diversity at the same time in multi-document Summarization, and demonstrates that the MDS method outperforms the DUC04 best method and the existing clustering-based methods.

...read moreread less

Abstract: Multi-document Summarization (MDS) is of great value to many real world applications. Many scoring models are proposed to select appropriate sentences from documents to form the summary, in which the clustering-based methods are popular. In this work, we propose a unified sentence scoring model which measures representativeness and diversity at the same time. Experimental results on DUC04 demonstrate that our MDS method outperforms the DUC04 best method and the existing clustering-based methods, and it yields close results compared to the state-of-the-art generic MDS methods. Advantages of the proposed MDS method are two-fold: (1) The density peaks clustering algorithm is firstly adopted, which is effective and fast. (2) No external resources such as Wordnet and Wikipedia or complex language parsing algorithms is used, making reproduction and deployment very easy in real environment.

...read moreread less

46 citations

Proceedings Article•DOI•

FarsiSum: a Persian text summarizer

[...]

Martin Hassel¹, Nima Mazdak²•Institutions (2)

Royal Institute of Technology¹, Stockholm University²

28 Aug 2004

TL;DR: FarsiSum is an attempt to create an automatic text summarization system for Persian that uses modules implemented in an existing summarizer geared towards the Germanic languages, a Persian stop-list in Unicode format and a small set of heuristic rules.

...read moreread less

Abstract: FarsiSum is an attempt to create an automatic text summarization system for Persian. The system is implemented as a HTTP client/server application written in Perl. It uses modules implemented in an existing summarizer geared towards the Germanic languages, a Persian stop-list in Unicode format and a small set of heuristic rules.

...read moreread less

46 citations

Journal Article•DOI•

Using topic themes for multi-document summarization

[...]

Sanda M. Harabagiu¹, Finley Lacatusu¹•Institutions (1)

University of Texas at Dallas¹

02 Jul 2010-ACM Transactions on Information Systems

TL;DR: This article presents eight different methods of generating multidocument summaries and evaluates each of these methods on a large set of topics used in past DUC workshops, showing a significant improvement in the quality of summaries based on topic themes over MDS methods that use other alternative topic representations.

...read moreread less

Abstract: The problem of using topic representations for multidocument summarization (MDS) has received considerable attention recently. Several topic representations have been employed for producing informative and coherent summaries. In this article, we describe five previously known topic representations and introduce two novel representations of topics based on topic themes. We present eight different methods of generating multidocument summaries and evaluate each of these methods on a large set of topics used in past DUC workshops. Our evaluation results show a significant improvement in the quality of summaries based on topic themes over MDS methods that use other alternative topic representations.

...read moreread less

46 citations

DOI•

Domain-specific informative and indicative summarization for information retrieval

[...]

Judith L. Klavans¹, Min-Yen Kan¹, Kathleen R. McKeown¹•Institutions (1)

Columbia University¹

01 Jan 2001

TL;DR: The use of multido ument summarization as a post-pro essing step in do ument retrieval is proposed and the use of the summary as a repla ement to the standard ranked list is examined.

...read moreread less

Abstract: In this paper, we propose the use of multido ument summarization as a post-pro essing step in do ument retrieval We examine the use of the summary as a repla ement to the standard ranked list The form of the summary is novel be ause it has both informative and indi ate elements, designed to help di erent users perform their tasks better Our summary uses the do uments' topi al stru ture as a ba kbone for its own stru ture, as it was deemed the most useful do ument feature in our study of a orpus of summaries

...read moreread less

46 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
…
65
66
67
68
69
70
71
…
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
142
143
144
145
146
147
148
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics