scispace - formally typeset
Open AccessJournal ArticleDOI

Inter and Intra Cluster on Self-adaptive Differential Evolution for Multi-document Summarization

TLDR
This paper proposes an inter and intra cluster which consist of four weighted criteria functions (coherence, coverage, diversity, and inter-cluster analysis) to be optimized by using SaDE (Self Adaptive Differential Evolution) to get the best summary result.
Abstract
Multi – document as one of summarization type has become more challenging issue than single-document because its larger space and its different content of each document. Hence, some of optimization algorithms consider some criteria in producing the best summary, such as relevancy, content coverage, and diversity. Those weighted criteria based on the assumption that the multi-documents are already located in the same cluster. However, in a certain condition, multi-documents consist of many categories and need to be considered too. In this paper, we propose an inter and intra cluster which consist of four weighted criteria functions (coherence, coverage, diversity, and inter-cluster analysis) to be optimized by using SaDE (Self Adaptive Differential Evolution) to get the best summary result. Therefore, the proposed method will deal not only with the value of compactness quality of the cluster within but also the separation of each cluster. Experimental results on Text Analysis Conference (TAC) 2008 datasets yields better summaries results with average ROUGE-1 on precision, recall, and f - measure 0.77, 0.07, and 0.12 compared to another method that only consider the analysis of intra-cluster.

read more

Citations
More filters
Journal ArticleDOI

Review of automatic text summarization techniques & methods

TL;DR: This paper provides a broad and systematic review of research in the field of text summarization published from 2008 to 2019 and describes the techniques and methods that are often used by researchers as a comparison and means for developing methods.
Journal ArticleDOI

Generación de resúmenes extractivos de múltiples documentos usando grafos semánticos

TL;DR: The conceptualization and underlying semantics structure of the textual content is represented in a semantic graph using WordNet, and a concept clustering algorithm is applied to identifying the topics of the documents set, with which the relevance of the sentences is evaluated to build the summary.
References
More filters
Posted Content

On Information and Sufficiency

TL;DR: The information deviation between any two finite measures cannot be increased by any statistical operations (Markov morphisms) and is invarient if and only if the morphism is sufficient for these two measures as mentioned in this paper.
Journal ArticleDOI

The automatic creation of literature abstracts

TL;DR: In the exploratory research described, the complete text of an article in machine-readable form is scanned by an IBM 704 data-processing machine and analyzed in accordance with a standard program.
Journal ArticleDOI

A new sentence similarity measure and sentence based extractive technique for automatic text summarization

TL;DR: The purpose of present paper is to show, that summarization result not only depends on optimized function, and also depends on a similarity measure.
Journal ArticleDOI

Extensions of kmeans-type algorithms: a new clustering framework by integrating intracluster compactness and intercluster separation.

TL;DR: Experimental studies demonstrate that the proposed algorithms outperform the state-of-the-art kmeans-type clustering algorithms with respect to four metrics: accuracy, RandIndex, Fscore, and normal mutual information.

Comparison Between K-Mean and Hierarchical Algorithm Using Query Redirection

Manpreet kaur
TL;DR: The proposed work represents query redirection method that improved K-means clustering algorithm performance and accuracy in distributed environment and shows that k-mean algorithm performs better as compared to hierarchical algorithm and takes less time for execution.
Related Papers (5)