scispace - formally typeset
Search or ask a question
Topic

Multi-document summarization

About: Multi-document summarization is a research topic. Over the lifetime, 2270 publications have been published within this topic receiving 71850 citations.


Papers
More filters
Proceedings ArticleDOI
10 Sep 2007
TL;DR: The paper proposes a model for adaptation of Web content and summarization and classification of text documents in specified categories based on user behavior and preferences and aims to provide mobile Internet users with a useful tool for web access.
Abstract: With the rapid growth of the small screen audience, the need for less cumbersome methods of browsing information on the Internet is growing too. Small screen devices provide portability in accessing huge information space on the Internet. It is unattainable to load and visualize large documents on hand held devices. Our objective is ultimately to provide mobile Internet users with a useful tool for web access. The paper proposes a model for adaptation of Web content and summarization and classification of text documents in specified categories. This paper describes the text classification and summarization based on user behavior and preferences. Experiments have been carried out on text database of 500 files and results are found to be quite satisfactory.

1 citations

Journal Article
TL;DR: A special method that creates text summary by discovering thematic areas from Chinese text by adopting k-medoids clustering method as well as a novel clustering analysis method based on self-defined objective function is proposed.
Abstract: Automatic summarization is an important issue in Natural Language Processing. This paper has proposed a special method that creates text summary by discovering thematic areas from Chinese text. The specificity of the method is that the created summary can both cover as many as different themes and reduce its redundancy obviously at the same time. And the discovery of latent thematic areas under the adaptive clustering of passages is realized by adopting k-medoids clustering method as well as a novel clustering analysis method based on self-defined objective function. In addition, a novel parameter,which is known as representation entropy,is used for summarization redun- dancy evaluation. Experimental results indicate that this method is effective and efficient in the automatic summariza- tion literature.

1 citations

Book ChapterDOI
01 Jan 2020
TL;DR: This chapter classifies the presented methodology, highlights the main pros and cons, and discusses the perspectives of the extension of the current research towards cross-lingual summarization systems.
Abstract: The recent advances in multimedia and web-based applications have eased the accessibility to large collections of textual documents. To automate the process of document analysis, the research community has put relevant efforts into extracting short summaries of the document content. However, most of the early proposed summarization methods were tailored to English-written textual corpora or to collections of documents all written in the same language. More recently, the joint efforts of the machine learning and the natural language processing communities have produced more portable and flexible solutions, which can be applied to documents written in different languages. This chapter first overviews the most relevant language-specific summarization algorithms. Then, it presents the most recent advances in multiand cross-lingual text summarization. The chapter classifies the presented methodology, highlights the main pros and cons, and discusses the perspectives of the extension of the current research towards cross-lingual summarization systems. Combining Machine Learning and Natural Language Processing for Language-Specific, MultiLingual, and Cross-Lingual Text Summarization: A Wide-Ranging Overview Luca Cagliero Politecnico di Torino, Italy

1 citations


Network Information
Related Topics (5)
Natural language
31.1K papers, 806.8K citations
85% related
Ontology (information science)
57K papers, 869.1K citations
84% related
Web page
50.3K papers, 975.1K citations
83% related
Recurrent neural network
29.2K papers, 890K citations
83% related
Graph (abstract data type)
69.9K papers, 1.2M citations
83% related
Performance
Metrics
No. of papers in the topic in previous years
YearPapers
202374
2022160
202152
202061
201947
201852