Inter and Intra Cluster on Self-adaptive Differential Evolution for Multi-document Summarization

doi:10.21609/JIKI.V11I2.547

Open AccessJournal ArticleDOI

Inter and Intra Cluster on Self-adaptive Differential Evolution for Multi-document Summarization

- Vol. 11, Iss: 2, pp 86-94

TLDR

This paper proposes an inter and intra cluster which consist of four weighted criteria functions (coherence, coverage, diversity, and inter-cluster analysis) to be optimized by using SaDE (Self Adaptive Differential Evolution) to get the best summary result.

Abstract:

Multi – document as one of summarization type has become more challenging issue than single-document because its larger space and its different content of each document. Hence, some of optimization algorithms consider some criteria in producing the best summary, such as relevancy, content coverage, and diversity. Those weighted criteria based on the assumption that the multi-documents are already located in the same cluster. However, in a certain condition, multi-documents consist of many categories and need to be considered too. In this paper, we propose an inter and intra cluster which consist of four weighted criteria functions (coherence, coverage, diversity, and inter-cluster analysis) to be optimized by using SaDE (Self Adaptive Differential Evolution) to get the best summary result. Therefore, the proposed method will deal not only with the value of compactness quality of the cluster within but also the separation of each cluster. Experimental results on Text Analysis Conference (TAC) 2008 datasets yields better summaries results with average ROUGE-1 on precision, recall, and f - measure 0.77, 0.07, and 0.12 compared to another method that only consider the analysis of intra-cluster.

Inter and Intra Cluster on Self-adaptive Differential Evolution for Multi-document Summarization

Citations

Review of automatic text summarization techniques & methods

Generación de resúmenes extractivos de múltiples documentos usando grafos semánticos

References

On Information and Sufficiency

The automatic creation of literature abstracts

A new sentence similarity measure and sentence based extractive technique for automatic text summarization

Extensions of kmeans-type algorithms: a new clustering framework by integrating intracluster compactness and intercluster separation.

Comparison Between K-Mean and Hierarchical Algorithm Using Query Redirection

Related Papers (5)

Document clustering based on cluster validation

A Cluster-Weighted Kernel K-Means Method for Multi-View Clustering

A multi-agent system for distributed cluster analysis

On Cluster Validity and the Information Need of Users

Diversity based cluster weighting in cluster ensemble: an information theory approach