scispace - formally typeset
Search or ask a question
Author

Vinh-Loc Dao

Bio: Vinh-Loc Dao is an academic researcher from Centre national de la recherche scientifique. The author has contributed to research in topics: Complex network & Network science. The author has an hindex of 5, co-authored 7 publications receiving 70 citations.

Papers
More filters
Journal ArticleDOI
TL;DR: This paper provides comprehensive analyses on computation time, community size distribution, a comparative evaluation of methods according to their optimization schemes as well as a comparison of their partitioning strategy through validation metrics, and proposes ways to classify community detection methods.
Abstract: Discovering community structure in complex networks is a mature field since a tremendous number of community detection methods have been introduced in the literature. Nevertheless, it is still very challenging for practitioners to determine which method would be suitable to get insights into the structural information of the networks they study. Many recent efforts have been devoted to investigating various quality scores of the community structure, but the problem of distinguishing between different types of communities is still open. In this paper, we propose a comparative, extensive, and empirical study to investigate what types of communities many state-of-the-art and well-known community detection methods are producing. Specifically, we provide comprehensive analyses on computation time, community size distribution, a comparative evaluation of methods according to their optimization schemes as well as a comparison of their partitioning strategy through validation metrics. We process our analyses on a very large corpus of hundreds of networks from five different network categories and propose ways to classify community detection methods, helping a potential user to navigate the complex landscape of community detection.

38 citations

Book ChapterDOI
11 Dec 2018
TL;DR: This paper proposes a novel approach to estimate the similarity between community detection methods using the size density distributions of communities that they detect and shows that there is a very clear distinction between the partitioning strategies of differentcommunity detection methods.
Abstract: Detecting community structure discloses tremendous information about complex networks and unlock promising applied perspectives. Accordingly, a numerous number of community detection methods have been proposed in the last two decades with many rewarding discoveries. Notwithstanding, it is still very challenging to determine a suitable method in order to get more insights into the mesoscopic structure of a network given an expected quality, especially on large scale networks. Many recent efforts have also been devoted to investigating various qualities of community structure associated with detection methods, but the answer to this question is still very far from being straightforward. In this paper, we propose a novel approach to estimate the similarity between community detection methods using the size density distributions of communities that they detect. We verify our solution on a very large corpus of networks consisting in more than a hundred networks of five different categories and deliver pairwise similarities of 16 state-of-the-art and well-known methods. Interestingly, our result shows that there is a very clear distinction between the partitioning strategies of different community detection methods. This distinction plays an important role in assisting network analysts to identify their rule-of-thumb solutions.

22 citations

Journal ArticleDOI
TL;DR: In this article, a comparative, extensive and empirical study to investigate what types of communities many state-of-the-art and well-known community detection methods are producing is presented.
Abstract: Discovering community structure in complex networks is a mature field since a tremendous number of community detection methods have been introduced in the literature. Nevertheless, it is still very challenging for practioners to determine which method would be suitable to get insights into the structural information of the networks they study. Many recent efforts have been devoted to investigating various quality scores of the community structure, but the problem of distinguishing between different types of communities is still open. In this paper, we propose a comparative, extensive and empirical study to investigate what types of communities many state-of-the-art and well-known community detection methods are producing. Specifically, we provide comprehensive analyses on computation time, community size distribution, a comparative evaluation of methods according to their optimisation schemes as well as a comparison of their partioning strategy through validation metrics. We process our analyses on a very large corpus of hundreds of networks from five different network categories and propose ways to classify community detection methods, helping a potential user to navigate the complex landscape of community detection.

13 citations

Book ChapterDOI
15 Jan 2017
TL;DR: This work proposes a methodology that allows one to expose structural information of clusters in a network partition in a comprehensive way, thus eventually helps one to compare communities identified by different community detection methods and helps to clarify the composition of communities in real-world networks.
Abstract: Evaluating a network partition just only via conventional quality metrics - such as modularity, conductance or normalized mutual of information - is usually insufficient. Indeed, global quality scores of a network partition or its clusters do not provide many ideas about their structural characteristics. Furthermore, quality metrics often fail to reach an agreement especially in networks whose modular structures are not very obvious. Evaluating the goodness of network partitions in function of desired structural properties is still a challenge. Here, we propose a methodology that allows one to expose structural information of clusters in a network partition in a comprehensive way, thus eventually helps one to compare communities identified by different community detection methods. This descriptive approach also helps to clarify the composition of communities in real-world networks. The methodology hence bring us a step closer to the understanding of modular structures in complex networks.

8 citations

Proceedings ArticleDOI
31 Jul 2017
TL;DR: Most common methods in the literature for community detection are brought together in a comparative approach and their performances in both real-world networks and synthetic networks are revealed.
Abstract: Community detection emerged as an important exploratory task in complex networks analysis across many scientific domains. Many methods have been proposed to solve this problem, each one with its own mechanism and sometimes with a different notion of community. In this article, we bring most common methods in the literature together in a comparative approach and reveal their performances in both real-world networks and synthetic networks. Surprisingly, many of those methods discovered better communities than the declared ground-truth communities in terms of some topological goodness features, even on benchmarking networks with built-in communities. We illustrate different structural characteristics that these methods could identify in order to support users to choose an appropriate method according to their specific requirements on different structural qualities.

7 citations


Cited by
More filters
Proceedings ArticleDOI
22 Jan 2006
TL;DR: Some of the major results in random graphs and some of the more challenging open problems are reviewed, including those related to the WWW.
Abstract: We will review some of the major results in random graphs and some of the more challenging open problems. We will cover algorithmic and structural questions. We will touch on newer models, including those related to the WWW.

7,116 citations

Journal ArticleDOI
TL;DR: The aim of CDlib is to allow easy and standardized access to a wide variety of network clustering algorithms, to evaluate and compare the results they provide, and to visualize them.
Abstract: Community Discovery is among the most studied problems in complex network analysis. During the last decade, many algorithms have been proposed to address such task; however, only a few of them have been integrated into a common framework, making it hard to use and compare different solutions. To support developers, researchers and practitioners, in this paper we introduce a python library - namely CDlib - designed to serve this need. The aim of CDlib is to allow easy and standardized access to a wide variety of network clustering algorithms, to evaluate and compare the results they provide, and to visualize them. It notably provides the largest available collection of community detection implementations, with a total of 39 algorithms.

83 citations

Journal ArticleDOI
TL;DR: Experiments show that the proposed community detection algorithm based on influential nodes (LGIEM) is able to detect communities efficiently, and achieves better performance compared to other recent methods.

77 citations

Journal ArticleDOI
TL;DR: A method to identify typologies of structure in large collections of personal networks by applying standard cluster analysis techniques to the three variables and finding show that personal network structure can be effectively summarized using just three measures of cohesive subgroup characteristics.
Abstract: A recurrent finding in personal network research is that individual and social outcomes are influenced not just by the kind of people one knows, but also by how those people are connected to each other: that is, by the structure of one's personal network. The different ways in which a person's social contacts know and interact with each other reflect broader variations in personal communities and social structures, and shape patterns and processes of social capital, support, and isolation. This article proposes a method to identify typologies of network structure in large collections of personal networks. The method is illustrated with an application to six datasets collected in widely different circumstances and using various survey instruments. Results are compared with those from another recently introduced method to extract structural typologies of egocentric networks. Findings show that personal network structure can be effectively summarized using just three measures describing results of the Girvan-Newman algorithm for cohesive subgroup detection. Structural typologies can then be extracted through cluster analysis on the three variables, using well-known clustering quality statistics to select the optimal typology. Both typology detection methods considered in the article capture significant variation in personal network structures, but substantial levels of disagreement and cross-classification emerge between them. I discuss differences and similarities between the methods, and potential applications of the proposed typologies to substantive research on a variety of topics, including structures and transformations of personal communities, social support, and social capital.

52 citations