A novel graph clustering algorithm based on discrete-time quantum random walk

doi:10.1016/B978-0-12-804409-4.00011-5

Home
/
Papers
/
A novel graph clustering algorithm based on discrete-time quantum random walk

Book Chapter•DOI•

A novel graph clustering algorithm based on discrete-time quantum random walk

S.G. Roy¹, Amlan Chakrabarti²•Institutions (2)

Techno India College of Technology¹, Information Technology University²

01 Jan 2017-pp 361-389

TL;DR: This chapter explains how quantum random walk helps in graph-based clustering, and proposes a new quantum clustering algorithm based on the discrete-time quantum randomwalk, which finds the clusters from a given adjacency matrix of a graph.

read less

Abstract: The clustering activity is an unsupervised learning observation which coalesces the data into segments. Grouping of data is done by identification of common characteristics that are labeled as similarities among data on the basis of their characteristics. Graph clustering is a tool needed in many computer applications, such as network routing, analysis of social networks, computer vision, and VLSI physical design. This chapter explains how quantum random walk helps in graph-based clustering, and we propose a new quantum clustering algorithm. The proposed quantum clustering algorithm is based on the discrete-time quantum random walk, which finds the clusters from a given adjacency matrix of a graph. We give a quantum circuit model and Quantum Computing Language-based simulation of our algorithm and illustrate its faster rate of convergence. Simulation results for experimental graphs illustrate that our proposed algorithm shows an exponential speedup over existing classical algorithms.

...read moreread less

Citations

PDF

Open Access

More filters

Journal Article•DOI•

A machine learning approach to cluster destination image on Instagram

[...]

Veronika Arefieva¹, Roman Egger², Joanne Yu²•Institutions (2)

Johannes Kepler University of Linz¹, University of Salzburg²

01 Aug 2021-Tourism Management

TL;DR: This study constructed a novel methodological framework by evaluating different machine learning models to group textual information based on pictorial content to uncover the destination image based on Instagram photographs.

...read moreread less

50 citations

Journal Article•DOI•

Infinite mixture models for operational modal analysis: An automated and principled approach

[...]

Prasad Cheema¹, Prasad Cheema², Mehrisadat Makki Alamdari³, Mehrisadat Makki Alamdari⁴, Gareth A. Vio¹, Feng-Liang Zhang³, Chul-Woo Kim³ - Show less +3 more•Institutions (4)

University of Sydney¹, Commonwealth Scientific and Industrial Research Organisation², Kyoto University³, University of New South Wales⁴

20 Jan 2021-Journal of Sound and Vibration

TL;DR: This study presents one of the first attempts of DP-GMM for full automation of Operational Modal Analysis (OMA), validated using the field test data from a large-scale operating cable-stayed bridge, which has two closely-spaced modes around 3 Hz.

...read moreread less

16 citations

Journal Article•DOI•

Multivariate assessment of groundwater quality in the basement rocks of Osun State, Southwest, Nigeria

[...]

J. A. Awomeso¹, Syed Masood Ahmad², Adewale Matthew Taiwo¹•Institutions (2)

Federal University of Agriculture, Abeokuta¹, University of Bologna²

01 Mar 2020-Environmental Earth Sciences

TL;DR: In this paper, the authors assessed the possible contamination source to groundwater quality in the basement rocks of Osun State, South-Western Nigeria using multivariate analyses of Principal Component Analysis (PCA) and Cluster Analysis (CA).

...read moreread less

Abstract: Groundwater is a major source of drinking water in many rural and urban areas of developing nations. Pollution of groundwater from diverse sources is an issue of concern due to inherent health problems. This study assessed the possible contamination source to groundwater quality in the basement rocks of Osun State, South-Western Nigeria using multivariate analyses of Principal Component Analysis (PCA) and Cluster Analysis (CA). The secondary data from 536 wells across the 30 Local Government Areas in the State were collected from the Rural Water and Environmental Sanitation Agency (RUWESA). The groundwater data include pH, temperature, turbidity, oxido-reduction potential, total dissolved solids, electrical conductivity, total alkalinity, magnesium hardness, calcium hardness, total hardness, free chlorine, total chlorine, chloride, fluoride, nitrate, nitrite, iron, manganese and zinc. The data were subjected to simple and inferential statistics using the Statistical Package for Social Sciences (SPSS vs. 21.0). The mean results of groundwater parameters such as nitrate and Mn were higher than the World Health Organisation (WHO) limits of 0.4 and 10 mg/L, respectively. The results of the PCA and CA revealed possible sources of pollutants to the groundwater quality as weathering of bedrocks, leachate from septic tanks and dumpsites, runoff of materials, hardness, nutrients from agricultural lands, and chlorine pollution.

...read moreread less

15 citations

Cites methods from "A novel graph clustering algorithm ..."

...CA is the method based on comparing data structures (nodes) with one another based on their similarity (Roy and Chakrabarti 2017)....
[...]

Posted Content•DOI•

An aerosol classification scheme for global simulations using the K-means machine learning method

[...]

Jingmin Li¹, Johannes Hendricks¹, Mattia Righi¹, Christof Gerhard Beer¹•Institutions (1)

German Aerospace Center¹

23 Jul 2021-Geoscientific Model Development Discussions

TL;DR: In this paper, a machine learning K-means algorithm is applied to data of seven aerosol properties from a global aerosol simulation using EMAC-MADE3 to partition the aerosols properties across the global atmosphere in specific aerosol regimes.

...read moreread less

Abstract: . A machine learning K-means algorithm is applied to data of seven aerosol properties from a global aerosol simulation using EMAC-MADE3. The aim is to partition the aerosol properties across the global atmosphere in specific aerosol regimes. K-means is an unsupervised machine learning method with the advantage that an a priori definition of the aerosol classes is not required. Using K-means, we are able to quantitatively define global aerosol regimes, so-called aerosol clusters, and explain their internal properties as well as their location and extension. This analysis shows that aerosol regimes in the lower troposphere are strongly influenced by emissions. Key drivers of the clusters’ internal properties and spatial distribution are, for instance, pollutants from biomass burning/biogenic sources, mineral dust, anthropogenic pollution, as well as their mixing. Several continental clusters propagate into oceanic regions. The identified oceanic regimes show a higher degree of pollution in the northern hemisphere than over the southern oceans. With increasing altitude, the aerosol regimes propagate from emission-induced clusters in the lower troposphere to roughly zonally distributed regimes in the middle troposphere and in the tropopause region. Notably, three polluted clusters identified over Africa, India and eastern China, cover the whole atmospheric column from the lower troposphere to the tropopause region. A markedly wide application potential of the classification procedure is identified and further aerosol studies are proposed which could benefit from this classification.

...read moreread less

5 citations

Additional excerpts

...Model Dev., 15, 509–533, 2022 gupta, 2016; Roy and Chakrabarti, 2017)....
[...]

Proceedings Article•DOI•

Clustering and parallel indexing of big IoT data in the fog‐cloud computing level

[...]

Karima Khettabi, Zineddine Kouahla, Brahim Farou, Hamid Seridi, Mohamed Amine Ferrag - Show less +1 more

07 Mar 2022-European Transactions on Telecommunications

TL;DR: The experimental results showed that the combination of DBSCAN clustering and parallel indexing make the B3CF‐trees outperform the latest real data indexing methods, in terms of quality, and the use of parallelism during kNN search reduced, significantly, the retrieve time of the similarity query search.

...read moreread less

Abstract: In recent years, the large amount of heterogeneous data generated by the Internet of Things (IoT) sensors and devices made recording and research tasks much more difficult, and most of the state‐of‐the‐art methods have failed to deal with the new IoT requirements. This article proposes a new efficient method that simplifies data indexing and enhances the quality and velocity of the similarity query search in the IoT environment. In this method, the fog layer was divided into two levels. In the clustering fog level, the incremental density‐based spatial clustering of applications with noise (DBSCAN) algorithm was used to separate collected data into clusters in order to minimize data overlap during in parallel indexes construction. Parallelism was also used, in the indexing fog level to speed up the similarity‐based search process and speed up the similarity‐based search process. The data in each cluster were indexed using our proposed structure called B3CF‐tree (binary tree based on containers at the cloud‐clusters fog computing level). The objects in the leaf nodes of the B3CF‐trees are, finally, stored in the cloud. Using this approach for computing multiple datasets, the retrieve time of the similarity search is significantly reduced. The experimental results showed that the combination of DBSCAN clustering and parallel indexing make the B3CF‐trees outperform the latest real data indexing methods. For example, in terms of quality, the B3CF‐tree has the smallest number of nodes and leaf nodes. In addition, the use of parallelism during kNN search reduced, significantly, the retrieve time of the similarity query search.

...read moreread less

4 citations

References

PDF

Open Access

More filters

Journal Article•DOI•

Modularity and community structure in networks

[...]

Mark Newman¹•Institutions (1)

University of Michigan¹

06 Jun 2006-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: In this article, the modularity of a network is expressed in terms of the eigenvectors of a characteristic matrix for the network, which is then used for community detection.

...read moreread less

Abstract: Many networks of interest in the sciences, including social networks, computer networks, and metabolic and regulatory networks, are found to divide naturally into communities or modules. The problem of detecting and characterizing this community structure is one of the outstanding issues in the study of networked systems. One highly effective approach is the optimization of the quality function known as “modularity” over the possible divisions of a network. Here I show that the modularity can be expressed in terms of the eigenvectors of a characteristic matrix for the network, which I call the modularity matrix, and that this expression leads to a spectral algorithm for community detection that returns results of demonstrably higher quality than competing methods in shorter running times. I illustrate the method with applications to several published network data sets.

...read moreread less

10,137 citations

Journal Article•DOI•

Multidimensional binary search trees used for associative searching

[...]

Jon Louis Bentley¹•Institutions (1)

Stanford University¹

01 Sep 1975-Communications of The ACM

TL;DR: The multidimensional binary search tree (or k-d tree) as a data structure for storage of information to be retrieved by associative searches is developed and it is shown to be quite efficient in its storage requirements.

...read moreread less

Abstract: This paper develops the multidimensional binary search tree (or k-d tree, where k is the dimensionality of the search space) as a data structure for storage of information to be retrieved by associative searches. The k-d tree is defined and examples are given. It is shown to be quite efficient in its storage requirements. A significant advantage of this structure is that a single data structure can handle many types of queries very efficiently. Various utility algorithms are developed; their proven average running times in an n record file are: insertion, O(log n); deletion of the root, O(n(k-1)/k); deletion of a random node, O(log n); and optimization (guarantees logarithmic performance of searches), O(n log n). Search algorithms are given for partial match queries with t keys specified [proven maximum running time of O(n(k-t)/k)] and for nearest neighbor queries [empirically observed average running time of O(log n).] These performances far surpass the best currently known algorithms for these tasks. An algorithm is presented to handle any general intersection query. The main focus of this paper is theoretical. It is felt, however, that k-d trees could be quite useful in many applications, and examples of potential uses are given.

...read moreread less

7,159 citations

Proceedings Article•DOI•

Algorithms for quantum computation: discrete logarithms and factoring

[...]

Peter W. Shor¹•Institutions (1)

Bell Labs¹

20 Nov 1994

TL;DR: Las Vegas algorithms for finding discrete logarithms and factoring integers on a quantum computer that take a number of steps which is polynomial in the input size, e.g., the number of digits of the integer to be factored are given.

...read moreread less

Abstract: A computer is generally considered to be a universal computational device; i.e., it is believed able to simulate any physical computational device with a cost in computation time of at most a polynomial factor: It is not clear whether this is still true when quantum mechanics is taken into consideration. Several researchers, starting with David Deutsch, have developed models for quantum mechanical computers and have investigated their computational properties. This paper gives Las Vegas algorithms for finding discrete logarithms and factoring integers on a quantum computer that take a number of steps which is polynomial in the input size, e.g., the number of digits of the integer to be factored. These two problems are generally considered hard on a classical computer and have been used as the basis of several proposed cryptosystems. We thus give the first examples of quantum cryptanalysis. >

...read moreread less

6,961 citations

Proceedings Article•DOI•

A fast quantum mechanical algorithm for database search

[...]

Lov K. Grover¹•Institutions (1)

Bell Labs¹

01 Jul 1996

TL;DR: In this paper, it was shown that a quantum mechanical computer can solve integer factorization problem in a finite power of O(log n) time, where n is the number of elements in a given integer.

...read moreread less

Abstract: were proposed in the early 1980’s [Benioff80] and shown to be at least as powerful as classical computers an important but not surprising result, since classical computers, at the deepest level, ultimately follow the laws of quantum mechanics. The description of quantum mechanical computers was formalized in the late 80’s and early 90’s [Deutsch85][BB92] [BV93] [Yao93] and they were shown to be more powerful than classical computers on various specialized problems. In early 1994, [Shor94] demonstrated that a quantum mechanical computer could efficiently solve a well-known problem for which there was no known efficient algorithm using classical computers. This is the problem of integer factorization, i.e. testing whether or not a given integer, N, is prime, in a time which is a finite power of o (logN) . ----------------------------------------------

...read moreread less

6,335 citations

Journal Article•DOI•

An Information Flow Model for Conflict and Fission in Small Groups

[...]

Wayne W. Zachary

01 Dec 1977-Journal of Anthropological Research

TL;DR: In this paper, the authors used data from a voluntary association to construct a new formal model for a traditional anthropological problem, fission in small groups, where the process leading to fission is viewed as an unequal flow of sentiments and information across the ties in a social network.

...read moreread less

Abstract: Data from a voluntary association are used to construct a new formal model for a traditional anthropological problem, fission in small groups. The process leading to fission is viewed as an unequal flow of sentiments and information across the ties in a social network. This flow is unequal because it is uniquely constrained by the contextual range and sensitivity of each relationship in the network. The subsequent differential sharing of sentiments leads to the formation of subgroups with more internal stability than the group as a whole, and results in fission. The Ford-Fulkerson labeling algorithm allows an accurate prediction of membership in the subgroups and of the locus of the fission to be made from measurements of the potential for information flow across each edge in the network. Methods for measurement of potential information flow are discussed, and it is shown that all appropriate techniques will generate the same predictions.

...read moreread less

3,721 citations