scispace - formally typeset
Proceedings ArticleDOI

Query-oriented Unsupervised Multi-document Summarization on Big Data

Reads0
Chats0
TLDR
This paper proposes a hybrid MDS technique combining feature based algorithms and dynamic programming for generating a summary from multiple documents based on user provided query for serving a concise summary of multiple Webpage contents for a given user query in reduced time duration.
Abstract
Real time document summarization is a critical need nowadays, owing to the large volume of information available for our reading, and our inability to deal with this entirely due to limitations of time and resources. Oftentimes, information is available in multiple sources, offering multiple contexts and viewpoints on a single topic of interest. Automated multi-document summarization (MDS) techniques aim to address this problem. However, current techniques for automated MDS suffer from low precision and accuracy with reference to a given subject matter, when compared to those summaries prepared by humans and takes large time to create the summary when the input given is too huge. In this paper, we propose a hybrid MDS technique combining feature based algorithms and dynamic programming for generating a summary from multiple documents based on user provided query. Further, in real-world scenarios, Web search serves up a large number of URLs to users, and the work of making sense of these with reference to a particular query is left to the user. In this context, an efficient parallelized MDS technique based on Hadoop is also presented, for serving a concise summary of multiple Webpage contents for a given user query in reduced time duration.

read more

Citations
More filters
Journal ArticleDOI

Review on Query focused Multi-Document Summarization (QMDS) with Comparative Analysis

Suman Kundu
TL;DR: The problem of query-focused multi-document summarization (QMDS) is to generate a summary from multiple source documents on identical/similar topics based on the query submitted by the users as discussed by the authors .
References
More filters
Journal ArticleDOI

The use of MMR, diversity-based reranking for reordering documents and producing summaries

TL;DR: A method for combining query-relevance with information-novelty in the context of text retrieval and summarization and preliminary results indicate some benefits for MMR diversity ranking in document retrieval and in single document summarization.
Proceedings ArticleDOI

The Use of MMR and Diversity-Based Reranking for Reodering Documents and Producing Summaries

TL;DR: The MaximalMarginal Relevance (MMR) criterion as mentioned in this paper aims to reduce redundancy while maintaining query relevance in retrieving retrieved documents and selecting appropriate passages for text summarization.
Proceedings ArticleDOI

Graph-based ranking algorithms for sentence extraction, applied to text summarization

TL;DR: This paper presents an innovative unsupervised method for automatic sentence extraction using graph-based ranking algorithms and shows that the results obtained compare favorably with previously published results on established benchmarks.
Journal ArticleDOI

Automatic summarising: The state of the art

TL;DR: The conclusions drawn are that automatic summarisation has made valuable progress, with useful applications, better evaluation, and more task understanding, but summarising systems are still poorly motivated in relation to the factors affecting them, and evaluation needs taking much further to engage with the purposes summaries are intended to serve.