scispace - formally typeset
Search or ask a question
Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.


Papers
More filters
Proceedings ArticleDOI
11 May 2010
TL;DR: This paper attempts to provide a background study of the various classical methods proposed by researchers for automatic text summarization, with special focus on the most widely used algebraic methods called Singular Value Decomposition and Non-negative Matrix Factorization.
Abstract: Various kinds of information that is available on a topic electronically has abundantly increased over the past years. It has led the information highway to a situation called “information overload” problem. Automatic text summarization technique mainly addresses this issue by the extraction of a shortened version of information from texts written about the same topic. Several algebraic reduction methods are used to identify and extract the semantically important texts in a document to summarize it automatically. This paper attempts to provide a background study of the various classical methods proposed by researchers for automatic text summarization. Special focus is given to the most widely used algebraic methods called Singular Value Decomposition (SVD) and Non-negative Matrix Factorization (NMF). This work sheds more light on the application of SVD and NMF techniques on automatic text summarization. Attention is also devoted in this work to analyze the advantages and disadvantages of each approach.

9 citations

Proceedings ArticleDOI
01 Oct 2016
TL;DR: This paper reviews all the features that use metrics and concept of complex network for scoring sentences in extractive text summarization and proposes various combinations of various features.
Abstract: Automatic text summarization is an important research area in the domain of information systems. It aims to create a compressed version of documents, which should cover all the significant contents and overall meaning. In extractive text summarization, sentences are scored on various of features. A large number of features network based have been proposed by researchers in the past literatures. This paper reviews all the features that use metrics and concept of complex network for scoring sentences. The experiment results on single feature and combinations of various features we proposed are discussed. Quantitative and qualitative aspects were considered in our assessment performing on the DUC 2002 data sets.

9 citations

Proceedings ArticleDOI
08 May 2014
TL;DR: This paper proposes an automatic text summarization technique using both linguistic and statistical features using successive threshold for finding the summary i.e important sentences from the given input text document.
Abstract: Text summarization is an emerging technique for finding out the summary of the text document. Text summarization is nothing but summarizing the content of given text document. Text summarization has got so uses such as Due to the massive amount of information getting increased on internet; it is difficult for the user to go through all the information available on web. Summarization techniques need to be used to reduce the user's time in reading the whole information available on web. In this paper, we propose an automatic text summarization technique using both linguistic and statistical features using successive threshold for finding the summary i.e important sentences from the given input text document. Here the sentences are selected for summary based on the weight of the sentence. The weight of the sentences is calculated based on the statistical and linguistic features. Our approach assigns scores to the sentences by weighting the features like term frequency, word occurrences, and noun weight, phrases etc. In our approach, the number of sentences present in our summary would be equal to the number of paragraphs present in a text document, which can be achieved by using our successive threshold approach.

9 citations

Proceedings ArticleDOI
02 Dec 2013
TL;DR: The goal is to define a measurement for text summarization using Semantic Analysis Approach for Documents in Indonesian language using WordNet to obtain the similarity between sentences by calculating the vector values of each sentence with the title.
Abstract: Research about text summarization has been quite an interesting topic over the years, proven by numerous number of papers related with discussion of their studies such as approaches, challenges and trends. This paper's goal is to define a measurement for text summarization using Semantic Analysis Approach for Documents in Indonesian language. The applied measurement requires Indonesian version of WordNet which had been implemented roughly. The main idea of semantic analysis is to obtain the similarity between sentences by calculating the vector values of each sentence with the title. The need ofWordNet is to define the depth of each word as being computed for word similarity. Combining all required formulas and calculations, a compact and precise summarization is produced without depriving the gist information of certain document.

9 citations

Proceedings ArticleDOI
TL;DR: Through an understanding of the user's goals and concerns, means for measuring the success of the summarization tools are presented and guidelines for the successful use of summarization in consumer video devices are discussed.
Abstract: The immediate availability of a vast amount of multimedia content has created a growing need for improvements in the field of content analysis and summarization. While researchers have been rapidly making contributions and improvements to the field, we must never forget that content analysis and summarization themselves are not the user's goals. Users' primary interests fall into one of two categories; they normally either want to be entertained or want to be informed (or both). Summarization is therefore just another tool for improving the entertainment value or the information gathering value of the video watching experience. In this paper, we first explore the relationship between the viewer, the interface, and the summarization algorithms. Through an understanding of the user's goals and concerns, we present means for measuring the success of the summarization tools. Guidelines for the successful use of summarization in consumer video devices are also discussed.

9 citations


Network Information
Related Topics (5)
Natural language
31.1K papers, 806.8K citations
85% related
Ontology (information science)
57K papers, 869.1K citations
84% related
Web page
50.3K papers, 975.1K citations
83% related
Recurrent neural network
29.2K papers, 890K citations
83% related
Graph (abstract data type)
69.9K papers, 1.2M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202374
2022160
202152
202061
201947
201852