scispace - formally typeset
Search or ask a question
Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.


Papers
More filters
Journal ArticleDOI
TL;DR: This study focuses on metadata generation for producing visualized knowledge summarization and develops a prototype called FTCat©, which has practicality in summarizing news reports.
Abstract: Summarized text is a simplified and condensed versi on of the original text containing highlighted information to help the audience get the gist in a short period of time. Typically, text summarization produces abstract or a paragraph-like outputs by om itting details and irrelevant information. However, the text summary can also be produced in a visualized f orm, such as a chart, graph or table representing a collection of similar cases. The visualized version generates a statistical-like presentation, which o ften involves numerical and ordinal observation of the g athered knowledge from the text. This requires lexi cal syntactic understanding of the text. Essential to a chieve this goal is topic identification, message analysis/interpretation and knowledge summarization generation. The objective of this study is to mode l knowledge summarization problem using the evolving fuzzy grammar technique and we focus on metadata generation for producing visualized knowledge summarization. The process comprises of: (i) identifying the underlying structure of the texts for knowledge sum marization, (ii) represent the identified knowledge for summarization manipulation and (iii) presentation o f the summarized knowledge. A prototype called FTCat© is developed as a proof of concept and we demonstra te its practicality in summarizing news reports.

1 citations

Book ChapterDOI
I. Mani1
01 Jan 2006
TL;DR: The principal challenges, approaches, and outstanding problems in summarization are summarized and an account of evaluation methods as well as areas for future research are offered.
Abstract: With the explosion in the quantity of on-line text and multimedia information in recent years, demand for automatic summarization systems is growing. The goal of automatic summarization is to take a source document or documents, extract information content from it, and present the most important content in a condensed form in a manner sensitive to the needs of the user and task. This article summarizes the principal challenges, approaches, and outstanding problems in summarization. It offers an account of evaluation methods as well as areas for future research.

1 citations

Journal Article
TL;DR: The proposed method provides an automatic relevance judgment to reformulate query using the clustering method for minimizing a bias of query expansion and can improve the quality of document summarization since the summarized documents are influenced by the semantic features of documents and the expanded query.
Abstract: According to the increment of accessible text data source on the internet, it has increased the necessity of the automatic text document summarization. However, the performance of the automatic methods might be poor because the semantic gap between high level user's summary requirement and low level vector representation of machine exists. In this paper, to overcome that problem, we propose a new document summarization method using a pseudo relevance feedback based on clustering method and NMF (non-negative matrix factorization). Relevance feedback is effective technique to minimize the semantic gap of information processing, but the general relevance feedback needs an intervention of a user. Additionally, the refined query without user interference by pseudo relevance feedback may be biased. The proposed method provides an automatic relevance judgment to reformulate query using the clustering method for minimizing a bias of query expansion. The method also can improve the quality of document summarization since the summarized documents are influenced by the semantic features of documents and the expanded query. The experimental results demonstrate that the proposed method achieves better performance than the other document summarization methods.

1 citations

28 Feb 2014
TL;DR: A survey on different extractive techniques of text summarization, a very useful task that gives support to many other tasks as well as, it takes advantage of the techniques developed for related Natural Language Processing tasks.
Abstract: This paper presents a survey on different extractive techniques of text summarization. Text Summarization is a challenging problem these days. The rapid development of emerging technologies poses new challenges to this research field. A summary can give an overview of the original document in a shorter period of time. Readers may decide whether or not to read the complete document after going through the summary. Internet is a great advancement in the field of technology. Nowadays, people can search on Internet for everything. Internet provides large amount of information in the form of web pages. Thus, it is impossible for a person to read web pages completely and make the selection of required web pages. So, the need of producing summaries has become more and more widespread. Summarization offers the possibility of finding main points of text and so the user will spend less time on reading the whole document. Different types of summary might be useful in various applications and summarization can be classified based on these types. Summarization is a very useful task that gives support to many other tasks as well as, it takes advantage of the techniques developed for related Natural Language Processing tasks. Keywords— Text Summarization, Natural Language Processing, Centroid, Cluster

1 citations


Network Information
Related Topics (5)
Natural language
31.1K papers, 806.8K citations
85% related
Ontology (information science)
57K papers, 869.1K citations
84% related
Web page
50.3K papers, 975.1K citations
83% related
Recurrent neural network
29.2K papers, 890K citations
83% related
Graph (abstract data type)
69.9K papers, 1.2M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202374
2022160
202152
202061
201947
201852