Home
/
Topics
/
Multi-document summarization

Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.

...read moreread less

Papers published on a yearly basis

2023
2022
2021
2020
2019
2018
2017
2016
2015
2014
2013
2012
2011
2010
2009
2008
2007
2006
2005
2004
2003
2002
2001
2000
1999
1998
1997
1996
1995
1994
1991
1989
1987
1986
1985
1982

Papers

PDF

Open Access

More filters

Journal Article•DOI•

The CPR Model for Summarizing Video

[...]

Marat Fayzullin¹, V. S. Subrahmanian¹, Antonio Picariello, Maria Luisa Sapino²•Institutions (2)

University of Maryland, College Park¹, University of Turin²

01 Jun 2005-Multimedia Tools and Applications

TL;DR: This work proposes a model of video summarization based on three important parameters: Priority, Continuity, and non-Repetition, and shows examples of how CPR parameters can be computed and provide algorithms to find optimal summaries based on the CPR approach.

...read moreread less

Abstract: Most past work on video summarization has been based on selecting key frames from videos. We propose a model of video summarization based on three important parameters: Priority (of frames), Continuity (of the summary), and non-Repetition (of the summary). In short, a summary must include high priority frames and must be continuous and non-repetitive. An optimal summary is one that maximizes an objective function based on these three parameters. We show examples of how CPR parameters can be computed and provide algorithms to find optimal summaries based on the CPR approach. Finally, we briefly report on the performance of these algorithms.

...read moreread less

18 citations

Posted Content•

Document summarization using positive pointwise mutual information

[...]

S. Aji, M. Ramachandra Kaimal

08 May 2012-arXiv: Information Retrieval

TL;DR: In this paper, the authors used Positive Pointwise Mutual Information (PPMI) to assign weights for each entry in the Term-Sentence-Matrix (TSM) and then used the Sentence-Rank-Matrix generated from this weighted TSM, is then used to extract a summary from the document.

...read moreread less

Abstract: The degree of success in document summarization processes depends on the performance of the method used in identifying significant sentences in the documents. The collection of unique words characterizes the major signature of the document, and forms the basis for Term-Sentence-Matrix (TSM). The Positive Pointwise Mutual Information, which works well for measuring semantic similarity in the Term-Sentence-Matrix, is used in our method to assign weights for each entry in the Term-Sentence-Matrix. The Sentence-Rank-Matrix generated from this weighted TSM, is then used to extract a summary from the document. Our experiments show that such a method would outperform most of the existing methods in producing summaries from large documents.

...read moreread less

18 citations

Journal Article•DOI•

Multi-document summarization via group sparse learning

[...]

Ruifang He¹, Jiliang Tang², Pinghua Gong², Qinghua Hu¹, Bo Wang¹ - Show less +1 more•Institutions (2)

Tianjin University¹, Arizona State University²

01 Jul 2016-Information Sciences

TL;DR: A novel compressive sensing based multi-document summarization with group sparse learning (SGS) framework is proposed, which can maximally reconstruct the original documents via minimizing the approximation error and jointly select summary sentences with the learnt group structure information among sentences.

...read moreread less

18 citations

Journal Article•DOI•

An Empirical Survey on Long Document Summarization: Datasets, Models, and Metrics

[...]

Huan Yee Koh, Ming Liu, Shirui Pan

03 Jul 2022-ACM Computing Surveys

TL;DR: This survey provides a comprehensive overview of the research on long document summarization and a systematic evaluation across the three principal components of its research setting: benchmark datasets, summarization models, and evaluation metrics.

...read moreread less

Abstract: Long documents such as academic articles and business reports have been the standard format to detail out important issues and complicated subjects that require extra attention. An automatic summarization system that can effectively condense long documents into short and concise texts to encapsulate the most important information would thus be significant in aiding the reader’s comprehension. Recently, with the advent of neural architectures, significant research efforts have been made to advance automatic text summarization systems, and numerous studies on the challenges of extending these systems to the long document domain have emerged. In this survey, we provide a comprehensive overview of the research on long document summarization and a systematic evaluation across the three principal components of its research setting: benchmark datasets, summarization models, and evaluation metrics. For each component, we organize the literature within the context of long document summarization and conduct an empirical analysis to broaden the perspective on current research progress. The empirical analysis includes a study on the intrinsic characteristics of benchmark datasets, a multi-dimensional analysis of summarization models, and a review of the summarization evaluation metrics. Based on the overall findings, we conclude by proposing possible directions for future exploration in this rapidly growing field.

...read moreread less

18 citations

Proceedings Article•DOI•

Incorporating prior knowledge into a transductive ranking algorithm for multi-document summarization

[...]

Massih R. Amini¹, Nicolas Usunier²•Institutions (2)

National Research Council¹, Pierre-and-Marie-Curie University²

19 Jul 2009

TL;DR: This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization by identifying topic themes within a document collection and iteratively trains a ranking function over these two sets of sentences.

...read moreread less

Abstract: This paper presents a transductive approach to learn ranking functions for extractive multi-document summarization. At the first stage, the proposed approach identifies topic themes within a document collection, which help to identify two sets of relevant and irrelevant sentences to a question. It then iteratively trains a ranking function over these two sets of sentences by optimizing a ranking loss and fitting a prior model built on keywords. The output of the function is used to find further relevant and irrelevant sentences. This process is repeated until a desired stopping criterion is met.

...read moreread less

18 citations

1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77
78
79
80
81
82
83
84
85
86
87
88
89
90
91
92
93
94
95
96
97
98
99
100
101
102
103
104
105
106
107
108
109
110
111
112
113
114
115
116
117
118
119
120
121
122
123
124
125
126
127
128
129
130
131
132
133
134
135
136
137
138
139
140
141
…
142
143
144
145
146
147
148
…
149
150
151
152
153
154
155
156
157
158
159
160
161
162
163
164
165
166
167
168
169
170
171
172
173
174
175
176
177
178
179
180
181
182
183
184
185
186
187
188
189
190
191
192
193
194
195
196
197
198
199
200

Collapse

Network Information

Performance

Metrics

2,507

Papers

81,726

Citations

No. of papers in the topic in previous years
Year	Papers
2023	74
2022	160
2021	52
2020	61
2019	47
2018	52

Multi-document summarization

Papers published on a yearly basis

Papers

Trending Questions (10)

Network Information

Related Topics (5)

Performance

Metrics