Community discovery using nonnegative matrix factorization

doi:10.1007/S10618-010-0181-Y

Home
/
Papers
/
Community discovery using nonnegative matrix factorization

Journal Article•DOI•

Community discovery using nonnegative matrix factorization

Fei Wang¹, Tao Li¹, Xin Wang¹, Shenghuo Zhu, Chris Ding² - Show less +1 more•Institutions (2)

Florida International University¹, University of Texas at Arlington²

01 May 2011-Data Mining and Knowledge Discovery (Springer US)-Vol. 22, Iss: 3, pp 493-521

TL;DR: This paper investigates another important issue, community discovery, in network analysis, and chooses Nonnegative Matrix Factorization (NMF) as a tool to find the communities because of its powerful interpretability and close relationship between clustering methods.

read less

Abstract: Complex networks exist in a wide range of real world systems, such as social networks, technological networks, and biological networks. During the last decades, many researchers have concentrated on exploring some common things contained in those large networks include the small-world property, power-law degree distributions, and network connectivity. In this paper, we will investigate another important issue, community discovery, in network analysis. We choose Nonnegative Matrix Factorization (NMF) as our tool to find the communities because of its powerful interpretability and close relationship between clustering methods. Targeting different types of networks (undirected, directed and compound), we propose three NMF techniques (Symmetric NMF, Asymmetric NMF and Joint NMF). The correctness and convergence properties of those algorithms are also studied. Finally the experiments on real world networks are presented to show the effectiveness of the proposed methods.

...read moreread less

Content maybe subject to copyright Report

Citations

PDF

Open Access

More filters

Proceedings Article•

Community preserving network embedding

[...]

Xiao Wang¹, Peng Cui¹, Jing Wang², Jian Pei³, Wenwu Zhu¹, Shiqiang Yang¹ - Show less +2 more•Institutions (3)

Tsinghua University¹, Bournemouth University², Simon Fraser University³

04 Feb 2017

TL;DR: A novel Modularized Nonnegative Matrix Factorization (M-NMF) model is proposed to incorporate the community structure into network embedding and jointly optimize NMF based representation learning model and modularity based community detection model in a unified framework, which enables the learned representations of nodes to preserve both of the microscopic and community structures.

...read moreread less

Abstract: Network embedding, aiming to learn the low-dimensional representations of nodes in networks, is of paramount importance in many real applications. One basic requirement of network embedding is to preserve the structure and inherent properties of the networks. While previous network embedding methods primarily preserve the microscopic structure, such as the first- and second-order proximities of nodes, the mesoscopic community structure, which is one of the most prominent feature of networks, is largely ignored. In this paper, we propose a novel Modularized Nonnegative Matrix Factorization (M-NMF) model to incorporate the community structure into network embedding. We exploit the consensus relationship between the representations of nodes and community structure, and then jointly optimize NMF based representation learning model and modularity based community detection model in a unified framework, which enables the learned representations of nodes to preserve both of the microscopic and community structures. We also provide efficient updating rules to infer the parameters of our model, together with the correctness and convergence guarantees. Extensive experimental results on a variety of real-world networks show the superior performance of the proposed method over the state-of-the-arts.

...read moreread less

756 citations

Cites background from "Community discovery using nonnegati..."

...(20) By lemma 2 in (Wang et al. 2011), we have −2αtr(HCUT ) ≤ −2αtr(CUTZ)− 2αtr(CUTH′)....
[...]
...(19) By lemma 6 in (Wang et al. 2011), we have βtr(HTB1H) ≤ 1 2 βtr(YTB1H ′) + 1 2 βtr(H′ T B1Y)....
[...]
...(21) By lemmas 6 and 7 in (Wang et al. 2011), we have...
[...]
...By lemma 4 in (Wang et al. 2011), we have −βtr(HTAH) ≤− βtr(H′TAZ)− βtr(ZTAH′) − βtr(H′TAH′), (18) and − (2λ− α)tr(HTH) ≤ −(2λ− α)tr(H′TZ) − (2λ− α)tr(ZTH′)− (2λ− α)tr(H′TH′)....
[...]
...(21) By lemmas 6 and 7 in (Wang et al. 2011), we have λtr(HTHHTH) ≤ λtr(PH′TH′) ≤ λtr(RH′TH′H′T ), (22) where Pij = (UTU)2ij (H′TH′)ij and Rij = H4ij H ′3 ij ....
[...]

Other•DOI•

Social Network Analysis: Computer Programs

[...]

Stanley Wasserman, Katherine Faust

01 Jan 1994

419 citations

Journal Article•DOI•

Efficient and principled method for detecting communities in networks.

[...]

Brian Ball¹, Brian Karrer¹, Mark Newman¹•Institutions (1)

University of Michigan¹

08 Sep 2011-Physical Review E

TL;DR: This work describes a method for finding overlapping communities based on a principled statistical approach using generative network models and shows how the method can be implemented using a fast, closed-form expectation-maximization algorithm that allows us to analyze networks of millions of nodes in reasonable running times.

...read moreread less

Abstract: A fundamental problem in the analysis of network data is the detection of network communities, groups of densely interconnected nodes, which may be overlapping or disjoint. Here we describe a method for finding overlapping communities based on a principled statistical approach using generative network models. We show how the method can be implemented using a fast, closed-form expectation-maximization algorithm that allows us to analyze networks of millions of nodes in reasonable running times. We test the method both on real-world networks and on synthetic benchmarks and find that it gives results competitive with previous methods. We also show that the same approach can be used to extract nonoverlapping community divisions via a relaxation method, and demonstrate that the algorithm is competitively fast and accurate for the nonoverlapping problem.

...read moreread less

412 citations

Posted Content•

InfoGraph: Unsupervised and Semi-supervised Graph-Level Representation Learning via Mutual Information Maximization

[...]

Fan-Yun Sun¹, Jordan Hoffmann², Vikas Verma³, Jian Tang⁴•Institutions (4)

National Taiwan University¹, Harvard University², Helsinki University of Technology³, HEC Montréal⁴

31 Jul 2019-arXiv: Learning

TL;DR: Experimental results on the tasks of graph classification and molecular property prediction show that InfoGraph is superior to state-of-the-art baselines and InfoGraph* can achieve performance competitive with state- of- the-art semi-supervised models.

...read moreread less

Abstract: This paper studies learning the representations of whole graphs in both unsupervised and semi-supervised scenarios. Graph-level representations are critical in a variety of real-world applications such as predicting the properties of molecules and community analysis in social networks. Traditional graph kernel based methods are simple, yet effective for obtaining fixed-length representations for graphs but they suffer from poor generalization due to hand-crafted designs. There are also some recent methods based on language models (e.g. graph2vec) but they tend to only consider certain substructures (e.g. subtrees) as graph representatives. Inspired by recent progress of unsupervised representation learning, in this paper we proposed a novel method called InfoGraph for learning graph-level representations. We maximize the mutual information between the graph-level representation and the representations of substructures of different scales (e.g., nodes, edges, triangles). By doing so, the graph-level representations encode aspects of the data that are shared across different scales of substructures. Furthermore, we further propose InfoGraph*, an extension of InfoGraph for semi-supervised scenarios. InfoGraph* maximizes the mutual information between unsupervised graph representations learned by InfoGraph and the representations learned by existing supervised methods. As a result, the supervised encoder learns from unlabeled data while preserving the latent semantic space favored by the current supervised task. Experimental results on the tasks of graph classification and molecular property prediction show that InfoGraph is superior to state-of-the-art baselines and InfoGraph* can achieve performance competitive with state-of-the-art semi-supervised models.

...read moreread less

394 citations

Cites background from "Community discovery using nonnegati..."

...There has been a significant amount of previous work done studying many aspects of graphs including link prediction [13, 57] and node prediction [2]....
[...]

Posted Content•

The Why and How of Nonnegative Matrix Factorization.

[...]

Nicolas Gillis¹•Institutions (1)

University of Mons¹

21 Jan 2014-arXiv: Machine Learning

TL;DR: A recent subclass of NMF problems is presented, referred to as near-separable NMF, that can be solved efficiently (that is, in polynomial time), even in the presence of noise.

...read moreread less

Abstract: Nonnegative matrix factorization (NMF) has become a widely used tool for the analysis of high-dimensional data as it automatically extracts sparse and meaningful features from a set of nonnegative data vectors. We first illustrate this property of NMF on three applications, in image processing, text mining and hyperspectral imaging --this is the why. Then we address the problem of solving NMF, which is NP-hard in general. We review some standard NMF algorithms, and also present a recent subclass of NMF problems, referred to as near-separable NMF, that can be solved efficiently (that is, in polynomial time), even in the presence of noise --this is the how. Finally, we briefly describe some problems in mathematics and computer science closely related to NMF via the nonnegative rank.

...read moreread less

330 citations

Cites background from "Community discovery using nonnegati..."

...Other applications include air emission control [97], computational biology [34], blind source separation [22], single-channel source separation [82], clustering [35], music analysis [42], collaborative filtering [92], and community detection [106]....
[...]

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65
66
67
68
69
70
71
72
73
74
75
76
77

Collapse

References

PDF

Open Access

More filters

Journal Article•DOI•

Collective dynamics of small-world networks

[...]

Duncan J. Watts¹, Steven H. Strogatz¹•Institutions (1)

Cornell University¹

04 Jun 1998-Nature

TL;DR: Simple models of networks that can be tuned through this middle ground: regular networks ‘rewired’ to introduce increasing amounts of disorder are explored, finding that these systems can be highly clustered, like regular lattices, yet have small characteristic path lengths, like random graphs.

...read moreread less

Abstract: Networks of coupled dynamical systems have been used to model biological oscillators, Josephson junction arrays, excitable media, neural networks, spatial games, genetic control networks and many other self-organizing systems. Ordinarily, the connection topology is assumed to be either completely regular or completely random. But many biological, technological and social networks lie somewhere between these two extremes. Here we explore simple models of networks that can be tuned through this middle ground: regular networks 'rewired' to introduce increasing amounts of disorder. We find that these systems can be highly clustered, like regular lattices, yet have small characteristic path lengths, like random graphs. We call them 'small-world' networks, by analogy with the small-world phenomenon (popularly known as six degrees of separation. The neural network of the worm Caenorhabditis elegans, the power grid of the western United States, and the collaboration graph of film actors are shown to be small-world networks. Models of dynamical systems with small-world coupling display enhanced signal-propagation speed, computational power, and synchronizability. In particular, infectious diseases spread more easily in small-world networks than in regular lattices.

...read moreread less

39,297 citations

"Community discovery using nonnegati..." refers background in this paper

...Nowadays, complex networks exist in a wide variety of systems in different areas, such as social networks (Scott 2000; Wasserman and Faust 1994), technological networks (Amaral et al. 2000; Watts and Strogatz 1998 ), biological networks (Sharan 2005; Watts and Strogatz 1998) and information networks (Albert et al. 1999; Faloutsos et al.)....
[...]
...…variety of systems in different areas, such as social networks (Scott 2000; Wasserman and Faust 1994), technological networks (Amaral et al. 2000; Watts and Strogatz 1998), biological networks (Sharan 2005; Watts and Strogatz 1998) and information networks (Albert et al. 1999; Faloutsos et al.)....
[...]
...Nowadays, complex networks exist in a wide variety of systems in different areas, such as social networks (Scott 2000; Wasserman and Faust 1994), technological networks (Amaral et al. 2000; Watts and Strogatz 1998), biological networks (Sharan 2005; Watts and Strogatz 1998 ) and information networks (Albert et al. 1999; Faloutsos et al.)....
[...]

Journal Article•DOI•

Community structure in social and biological networks

[...]

Michelle Girvan¹, Mark Newman•Institutions (1)

Santa Fe Institute¹

11 Jun 2002-Proceedings of the National Academy of Sciences of the United States of America

TL;DR: This article proposes a method for detecting communities, built around the idea of using centrality indices to find community boundaries, and tests it on computer-generated and real-world graphs whose community structure is already known and finds that the method detects this known structure with high sensitivity and reliability.

...read moreread less

Abstract: A number of recent studies have focused on the statistical properties of networked systems such as social networks and the Worldwide Web. Researchers have concentrated particularly on a few properties that seem to be common to many networks: the small-world property, power-law degree distributions, and network transitivity. In this article, we highlight another property that is found in many networks, the property of community structure, in which network nodes are joined together in tightly knit groups, between which there are only looser connections. We propose a method for detecting such communities, built around the idea of using centrality indices to find community boundaries. We test our method on computer-generated and real-world graphs whose community structure is already known and find that the method detects this known structure with high sensitivity and reliability. We also apply the method to two networks whose community structure is not well known—a collaboration network and a food web—and find that it detects significant and informative community divisions in both cases.

...read moreread less

14,429 citations

"Community discovery using nonnegati..." refers background in this paper

...2005), which are usually called clusters or communities (Girvan and Newman 2002)....
[...]
...Besides that, most real world networks demonstrate that the nodes (or units) contained in their certain parts are densely connected to each other (Palla et al. 2005), which are usually called clusters or communities (Girvan and Newman 2002)....
[...]

Journal Article•DOI•

Learning the parts of objects by non-negative matrix factorization

[...]

Daniel D. Lee¹, H. Sebastian Seung¹, H. Sebastian Seung²•Institutions (2)

Alcatel-Lucent¹, Massachusetts Institute of Technology²

21 Oct 1999-Nature

TL;DR: An algorithm for non-negative matrix factorization is demonstrated that is able to learn parts of faces and semantic features of text and is in contrast to other methods that learn holistic, not parts-based, representations.

...read moreread less

Abstract: Is perception of the whole based on perception of its parts? There is psychological and physiological evidence for parts-based representations in the brain, and certain computational theories of object recognition rely on such representations. But little is known about how brains or computers might learn the parts of objects. Here we demonstrate an algorithm for non-negative matrix factorization that is able to learn parts of faces and semantic features of text. This is in contrast to other methods, such as principal components analysis and vector quantization, that learn holistic, not parts-based, representations. Non-negative matrix factorization is distinguished from the other methods by its use of non-negativity constraints. These constraints lead to a parts-based representation because they allow only additive, not subtractive, combinations. When non-negative matrix factorization is implemented as a neural network, parts-based representations emerge by virtue of two properties: the firing rates of neurons are never negative and synaptic strengths do not change sign.

...read moreread less

11,500 citations

"Community discovery using nonnegati..." refers methods in this paper

...It was originally proposed as a method for finding matrix factors with parts-of-whole interpretations ( Lee and Seung 1999 )....
[...]
...It was originally proposed as a method for finding matrix factors with parts-of-whole interpretations (Lee and Seung 1999)....
[...]

Learning parts of objects by non-negative matrix factorization

[...]

D. D. Lee

01 Jan 1999

TL;DR: In this article, non-negative matrix factorization is used to learn parts of faces and semantic features of text, which is in contrast to principal components analysis and vector quantization that learn holistic, not parts-based, representations.

...read moreread less

9,604 citations

Journal Article•DOI•

A tutorial on spectral clustering

[...]

Ulrike von Luxburg¹•Institutions (1)

Max Planck Society¹

01 Dec 2007-Statistics and Computing

TL;DR: In this article, the authors present the most common spectral clustering algorithms, and derive those algorithms from scratch by several different approaches, and discuss the advantages and disadvantages of these algorithms.

...read moreread less

Abstract: In recent years, spectral clustering has become one of the most popular modern clustering algorithms. It is simple to implement, can be solved efficiently by standard linear algebra software, and very often outperforms traditional clustering algorithms such as the k-means algorithm. On the first glance spectral clustering appears slightly mysterious, and it is not obvious to see why it works at all and what it really does. The goal of this tutorial is to give some intuition on those questions. We describe different graph Laplacians and their basic properties, present the most common spectral clustering algorithms, and derive those algorithms from scratch by several different approaches. Advantages and disadvantages of the different spectral clustering algorithms are discussed.

...read moreread less

9,141 citations