What have the authors stated for future works in "Graph regularized nonnegative matrix factorization for data representation" ?

Several questions remain to be investigated in their future work: 1. There is a parameter which controls the smoothness of their GNMF model. This suggests another way to extend NMF. For the F-norm formulation, Lin [ 30 ] shows that Lee and Seung ’ s multiplicative algorithm can not guarantee the convergence to a stationary point and suggests minor modifications on Lee and Seung ’ s algorithm, which can converge.

What are the two metrics used to measure the clustering performance?

Two metrics, the accuracy (AC) and the normalized mutual information metric (NMI) are used to measure the clustering performance.

What is the common measure for document in information retrieval community?

In this case, the dot-product of two document vectors becomes their cosine similarity, which is a widely used similarity measure for document in information retrieval community.

What is the popular spectral clustering algorithm?

Zha et al. [44] have shown that K-means clustering in the SVD subspace has a close connection to average association [38], which is a popular spectral clustering algorithm.

How many documents were kept in this experiment?

In this experiment, those documents appearing in two or more categories were removed and only the largest 30 categories were kept, thus leaving us with 9,394 documents in total.

What is the difference between the two GNMF models?

This shows that by leveraging the power of both the parts-based representation and graph Laplacian regularization, GNMF can learn a better compact representation.

(Open Access) Graph Regularized Nonnegative Matrix Factorization for Data Representation (2011) | Deng Cai

Q: What have the authors contributed in "Graph regularized nonnegative matrix factorization for data representation" ?

In this paper, the authors propose a novel algorithm, called Graph Regularized Nonnegative Matrix Factorization ( GNMF ), for this purpose.

Q: What can be used to construct the graph?

Besides the nearest neighbor information, other knowledge (e.g., label information, social network structure) about the data can also be used to construct the graph.

Q: What is the advantage of multiplicative updating rules?

The advantage of multiplicative updating rules is the guarantee of nonnegativity of U and V. Theorem 1 also guarantees that the multiplicative updating rules in (14) and (15) converge to a local optimum.

Graph Regularized Nonnegative Matrix

Factorization for Data Representation

Deng Cai, Member, IE EE,XiaofeiHe,Senior Member, IEEE,

Jiawei Han, Fellow, IEE E, and Thomas S. Huang, Fellow, IEEE

Abstract—Matrix factorization techniques have been frequently applied in information retrieval, computer vision, and pattern

recognition. Among them, Nonnegative Matrix Factorization (NMF) has received considerable attention due to its psychological and

physiological interpretation of naturally occurring data whose representation may be parts based in the human brain. On the other

hand, from the geometric perspective, the data is usually sampled from a low-dimensional manifold embedded in a high-dimensional

ambient space. One then hopes to find a compact representation,which uncovers the hidden semantics and simultaneously respects

the intrinsic geometric structure. In this paper, we propose a novel algorithm, called Graph Regularized Nonnegative Matrix

Factorization (GNMF), for this purpose. In GNMF, an affinity graph is constructed to encode the geometrical information and we seek a

matrix factorization, which respects the graph structure. Our empirical study shows encouraging results of the proposed algorithm in

comparison to the state-of-the-art algorithms on real-world problems.

Index Terms—Nonnegative matrix factorization, graph Laplacian, manifold regularization, clustering.

1INTRODUCTION

HE techniques for matrix factorization have become

popular in recent years for data representation. In many

problems in information retrieval, computer vision, and

pattern recognition, the input data matrix is of very high

dimension. This makes learning from example infeasible [15].

One then hopes to find two or more lower dimensional

matrices whose product provides a good approximation to

the original one. The canonical matrix factorization techni-

ques include LU decomposition, QR decomposition, vector

quantization, and Singular Value Decomposition (SVD).

SVD is one of the most frequently used matrix factoriza-

tion techniques. A singular value decomposition of an M 

N matrix X has the following form:

X ¼ UV

;

where U is an M  M orthogonal matrix, V is an N  N

orthogonal matrix, and  is an M  N diagonal matrix with



¼ 0 if i 6¼ j and 

 0. The quantities 

are called the

singular values of X, and the columns of U and V are called

left and right singular vectors, respectively. By removing

those singular vectors corresponding to sufficiently small

singular values, we get a low-rank approximation to the

original matrix. This approximation is optimal in terms of

the reconstruction e rror, and thus optimal for data

representation when euclidean structure is concerned. For

this reason, SVD has been applied to various real-world

applications such as face recognition ( eigenface, [40]) and

document representation (latent semantic indexing, [11]).

Previous studies have shown that there is psychological

and physiological evidence for parts-based representation

in the human brain [34], [41], [31]. The Nonnegative Matrix

Factorization (NMF) algorithm is proposed to learn the

parts of objects like human faces and text documents [33],

[26]. NMF aims to find two nonnegative matrices whose

product provides a good approximation to the original

matrix. The nonnegative constraints lead to a parts-based

representation because they allow only additive, not

subtractive, combinations. NMF has been shown to be

superior to SVD in face recognition [29] and document

clustering [42]. It is optimal for learning the parts of objects.

Recently, various researchers (see [39], [35], [1], [36], [2])

have considered the case when the data is drawn from

sampling a probability distribution that has support on or

near to a submanifold of the ambient space. Here, a

d-dimensional submanifold of a euclidean space IR

is a

subset M

 IR

, which locally looks like a flat d-dimen-

sional euclidean space [28]. In order to detect the under-

lying manifold structure, many manifold learning algorithms

have been proposed, such as Locally Linear Embedding

(LLE) [35], ISOMAP [39], and Laplacian Eigenmap [1]. All

of these algorithms use the so-called locally invariant idea

[18], i.e., the nearby points are likely to have similar

embeddings. It has been shown that learning performance

can be significantly enhanced if the geometrical structure is

exploited and the local invariance is considered.

1548 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 33, NO. 8, AUGU ST 2011

. D. Cai and X. He are with the State Key Lab of CAD&CG, College of

Computer Science, Zhejiang University, 388 Yu Hang Tang Rd.,

Hangzhou, Zhejiang 310058, China.

E-mail: {dengcai, xiaofeihe}@cad.zju.edu.cn.

. J. Han is with the Department of Computer Science, University of Illinois

at Urbana Champaign, Siebel Center, 201 N. Goodwin Ave., Urbana, IL

61801. E-mail: hanj@cs.uiuc.edu.

. T.S. Huang is with the Beckman Institute for Advanced Sciences and

Technology, Univ ersity of Illinois at Urbana Champaign, Beckman

Institute Center, 405 North Mathews Ave., Urbana, IL 61801.

E-mail: huang@ifp.uiuc.edu.

Manuscript received 28 Apr. 2009; revised 21 Dec. 2009; accepted 22 Oct.

2010; published online 13 Dec. 2010.

Recommended for acceptance by D.D. Lee.

For information on obtaining reprints of this article, please send e-mail to:

tpami@computer.org, and reference IEEECS Log Number

TPAMI-2009-04-0266.

Digital Object Identifier no. 10.1109/TPAMI.2010.231.

0162-8828/11/$26.00 ß 2011 IEEE Published by the IEEE Computer Society

Motivated by recent progress in matrix factorization and

manifold learning [2], [5], [6], [7], in this paper we propose a

novel algorithm, called Graph regularized Nonnegative

Matrix Factorization (GNMF), which explicitly considers

the local invariance. We encode the geometrical information

of the data space by constructing a nearest neighbor graph.

Our goal is to find a parts-based representation space in

which two data points are sufficiently close to each other, if

they are connected in the graph. To achieve this, we design

a new matrix factorization objective function and incorpo-

rate the graph structure into it. We also develop an

optimization scheme to solve the objective function based

on iterative updates of the two factor matrices. This leads to

a new parts-based data representation which respects the

geometrical structure of the data space. The convergence

proof of our optimization scheme is provided.

It is worthwhile to highlight several aspects of the

proposed approach here:

1. While the standard NMF fits the data in a euclidean

space, our algorithm exploits the intrinsic geometry

of the data distribution and incorporates it as an

additional regularization term. Hence, our algorithm

is particularly applicable when the data are sampled

from a submanifold which is embedded in high-

dimensional ambient space.

2. Our algorithm constructs a nearest neighbor graph

to model the manifold structure. The weight matrix

of the graph is highly sparse. Therefore, the multi-

plicative update rules for GNMF are very efficient.

By preserving the graph structure, our algorithm can

have more discriminating power than the standard

NMF algorithm.

3. Recent studies [17], [13] show that NMF is closely

related to Probabilistic Latent Semantic A nalysis

(PLSA) [21]. The latter is one of the most popular

topic modeling algorithms. Specifically, NMF with

KL-divergence formulation is equivalent to PLSA

[13]. From this viewpoint, th e pro posed GNMF

approach also provides a principled way for incorpor-

ating the geometrical structure into topic modeling.

4. The proposed framework is a general one that can

leverage the power of both NMF and graph Laplacian

regularization. Besides the nearest neighbor informa-

tion, other knowledge (e.g., label information, social

network structure) about the data can also be used to

construct the graph. This naturally leads to other

extensions (e.g., semi-supervised NMF).

The rest of the paper is organized as follows: In Section 2,

we give a brief review of NMF. Section 3 introduces our

algorithm and provides a convergence proof of our

optimization scheme. Extensive experimental results on

clustering are presented in Section 4. Finally, we provide

some concluding remarks and suggestions for future work

in Section 5.

2ABRIEF REVIEW OF NMF

NMF [26] is a matrix factorization algorithm that focuses

on the analysis of data matrices whose elements are

nonnegative.

Given a data matrix X ¼½x

; ...; x

2IR

MN

,each

column of X is a sample vector. NMF aims to find two

nonnegative matrices U ¼½u

2IR

MK

and V ¼½v

2

NK

whose product can well approximate the original

matrix X:

X  UV

There are two commonly used cost functions that quantify

the quality of the approximation. The first one is the square of

the euclidean distance between two matrices (the square of

the Frobenius norm of two matrices difference) [33]:

¼kX  UV

i;j



k¼1

: ð1Þ

The second one is the “divergence” between two

matrices [27]:

¼ DðXkUV

Þ¼

i;j

log

 x

þ y



; ð2Þ

where Y ¼½y

¼UV

. This cost function is referred to as

“divergence” of X from Y instead of “distance” between X

and Y because it is not symmet ric. In other words,

DðXkYÞ 6¼ DðYkXÞ. It reduces to the Kullback-Leibler

divergence or relative entropy, when

¼ 1,

so that X and Y can be regarded as normalized probability

distributions. We will refer O

as F-norm formulation and

as divergence formulation in the rest of the paper.

Although the objective functions O

in (1) and O

in (2)

are convex in U only or V only, they are not convex in both

variables together. Therefore, it is unrealistic to expect an

algorithm to find the global minimum of O

(or O

). Lee

and Seung [27] presented two iterative update algorithms.

The algorithm minimizing the objective function O

in (1) is

as follows:

ðXVÞ

ðUV

VÞ

ðX

UÞ

ðVU

UÞ

The algorithm minimizing the objective function O

in (2) is



;



It is proven that the above two algorithms will find local

minima of the objective functions O

and O

[27].

In reality, we have K  M and K  N. Thus, NMF

essentially tries to find a compressed approximation of the

original data matrix. We can view this approximation

column by column as



k¼1

; ð3Þ

where u

is the kth column vector of U. Thus, each data

vector x

is approximated by a linear combination of the

columns of U, weighted by the components of V. Therefore,

U can be regarded as containing a basis, that is, optimized

CAI ET AL.: GRAPH REGULARIZED NONNEGATIVE MATRIX FACTORIZATION FOR DATA REPRESENTATION 1549

for the linear approximation of the data in X. Let z

denote

the jth row of V, z

¼½v

; ...;v



. z

can be regarded as the

new representation of the jth data point with respect to the

new basis U. Since relatively few basis vectors are used to

represent many data vectors, a good approximation can only

be achieved if the basis vectors discover structure that is

latent in the data [27].

The nonnegative constraints on U and V only allow

additive combinations among different bases. This is the

most significant difference between NMF and the other

matrix factorization methods, e.g., SVD. Unlike SVD, no

subtractions can occur in NMF. For this reason, it is

believed that NMF can learn a parts-based representation

[26]. The advantages of this parts-based representation have

been observed in many real-world problems such as face

analysis [29], document clustering [42], and DNA gene

expression analysis [3].

3GRAPH REGULARIZED NONNEGATIVE MATRIX

FACTORIZATION

By using the nonnegative constraints, NMF can learn a

parts-based representation. However, NMF performs this

learning in the euclidean space. It fails to discover the

intrinsic geometrical and discriminating structure of the

data space, which is essential to the real-world applications.

In this section, we introduce our GNMF algorithm, which

avoids this limitation by incorporating a geometrically

based regularizer.

3.1 NMF with Manifold Regularization

Recall that NMF tries to find a set of basis vectors that can

be used to best approximate the data. One might further

hope that the basis vectors can respect the intrinsic

Riemannian stru cture, rather than ambient euclidean

structure. A natural assumption here could be that if two

data points x

; x

are close in the intrinsic geometry of the

data distribution, then z

and z

, the representations of these

two points with respect to the new basis, are also close to

each other. This assumption is usually referred to as local

invariance assumption [1], [19], [7], which plays an essential

role in the development of various kinds of algorithms,

including dimensionality reduction algorithms [1] a nd

semi-supervised learning algorithms [2], [46], [45].

Recent studies in spectral graph theory [9] and manifold

learning theory [1] have demonstrated that the local

geometric structure can be effectively modeled through a

nearest neighbor graph on a scatter of data points. Consider

a graph with N vertices, where each vertex corresponds to a

data point. For each data point x

, we find its p nearest

neighbors and put edges between x

and its neighbors.

There are many choices to define the weight matrix W on

the graph. Three of the most commonly used are as follows:

1. 0-1 Weighting. W

¼ 1, if and only if nodes j and l

are connected by an edge. This is the simplest

weighting method and is very easy to compute.

2. Heat Kernel Weighting.Ifnodesj and l are

connected, put

¼ e



x



Heat kernel has an intrinsic connection to the

Laplace-Beltrami operator on differentiable func-

tions on a manifold [1].

3. Dot-Product Weighting.Ifnodesj and l are

connected, put

¼ x

Note that if x is normalized to 1, the dot product of

two vectors is equivalent to the cosine similarity of

the two vectors.

The W

is used to measure the closeness of two points x

and x

. The different similarity measures are suitable for

different situations. For example, the cosine similarity (dot-

product weighting) is very popular in the IR community

(for processing documents), while for image data, the heat

kernel weight may be a better choice. Since W

in our paper

is only for measuring the closeness, we do not treat the

different weighting schemes separately.

The low-dimensional representation of x

with respect to

the new basis is z

¼½v

; ...;v



. Again, we can use either

euclidean distance

dðz

; z

Þ¼kz

 z

;

or divergence

Dðz

Þ¼

k¼1

log

 v

þ v



;

to measure the “dissimilarity” between the low-dimen-

sional representations of two data points with respect to the

new basis.

With the above defined weight matrix W, we can use the

following two terms to measure the smoothness of the low-

dimensional representation

j;l¼1

ðDðz

ÞþDðz

ÞÞW

j;l¼1

k¼1

log

þ v

log



;

ð4Þ

and

j;l¼1

 z

j¼1



j;l¼1

¼ TrðV

DVÞTrðV

WVÞ¼TrðV

LVÞ;

ð5Þ

where TrðÞ denotes the trace of a matrix and D is a

diagonal matrix whose entries are column (or row, since W

is symmetric) sums of W; D

. L ¼ D  W, which

is called graph Laplacian [9].

By minimizing R

(or R

), we expect that if two data

points x

and x

are close (i.e., W

is big), z

and z

are also

close to each other. Combining this geometrically-based

regularizer with the original NMF objective function leads

to our GNMF.

Given a data matrix X ¼½x

2IR

MN

, our GNMF aims

to find two nonnegative matrices U ¼½u

2IR

MK

and

1550 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 33, NO. 8, AUGU ST 2011

V ¼½v

2IR

NK

. Similarly to NMF, we can also use two

“distance” measures here. If the euclidean distance is used,

GNMF minimizes the objective function as follows:

¼kX  UV

þ TrðV

LVÞ: ð6Þ

If the divergence is used, GNMF minimizes

i¼1

j¼1

log

k¼1

 x

k¼1



j¼1

l¼1

k¼1

log

þ v

log



;

ð7Þ

where the regularizat ion parameter   0 controls the

smoothness of the new representation.

3.2 Updating Rules Minim izing (6)

The objective functions O

and O

of GNMF in (6) and (7)

are not convex in both U and V together. Therefore, it is

unrealistic to expect an algorithm to find the global minima.

In the following, we introduce two iterative algorithms

which can achieve local minima.

We first discuss how to minimize the objective functionO

which can be rewritten as

¼ Tr



ðX  UV

ÞðX  UV



þ TrðV

LVÞ

¼ Tr





 2Tr



XVU



þ Tr





þ TrðV

LVÞ;

ð8Þ

where the second equality applies the matrix properties

TrðABÞ¼TrðBAÞ and TrðAÞ¼TrðA

Þ. Let

and 

the lagrange multiplier for constraint u

 0 and v

 0,

respectively, and  ¼½

,  ¼½

, the Lagrange L is

L¼Tr





 2Tr



XVU



þ Tr





þ TrðV

LVÞþTrðU

ÞþTrðV

Þ:

ð9Þ

The partial derivatives of L with respect to U and V are

¼2XV þ 2UV

V þ ; ð10Þ

¼2X

U þ 2VU

U þ 2LV þ : ð11Þ

Using the KKT conditions

¼ 0 and 

¼ 0, we get

the following equations for u

and v

ðXVÞ

þðUV

VÞ

¼ 0; ð12Þ

ðX

UÞ

þðVU

UÞ

þ ðLVÞ

¼ 0: ð13Þ

These equations lead to the following updating rules:

ðXVÞ

ðUV

VÞ

; ð14Þ

ðX

U þ WVÞ

ðVU

U þ DVÞ

: ð15Þ

Regardingthesetwoupdatingrules,wehavethe

following theorem:

Theorem 1. The objective function O

in (6) is nonincreasing

under the updating rules in (14) and (15).

Please see the Appendix for a detailed proof for the

above theorem. Our proof essentially follows the idea in the

proof of Lee and Seung’s [27] paper for the original NMF.

Recent studies [8], [30] show that Lee and Seung’s [27]

multiplicative algorithm cannot guarantee the convergence

to a stationary point. Particularly, Lin [30] suggests minor

modifications on Lee and Seung’s algorithm, which can

converge. Our updating rules in (14) and (15) are essentially

similar to the updating rules for NMF, and therefore, Lin’s

modifications can also be applied.

When  ¼ 0, it is easy to check that the updating rules in

(14) and (15) reduce to the updating rules of the original NMF.

For the objective function of NMF, it is easy to check

that if U and V are the solution, then UD; VD

1

will also

form a solution for any positive diagonal matrix D.To

eliminate this uncertainty, in practice, people will further

require that the euclidean length of each column vector in

matrix U (or V) is 1 [42]. The matrix V (or U) will be

adjusted accordingly so that UV

does not change. This

can be achieved by

ﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃﬃ

: ð16Þ

Our GNMF also adopts this strategy. After the multi-

plicative updating procedure converges, we set the

euclidean length of each column vector in matrix U to 1

and adjust the matrix V so that UV

does not change.

3.3 Connection to Gradient Descent Method

Another general algorithm for minimizing the objective

function of GNMF in (6) is gradient descent [25]. For our

problem, gradient descent leads to the following additive

update rules:

þ 

þ 

: ð17Þ

The 

and 

are usually referred as step size parameters.

As long as 

and 

are sufficiently small, the above

updates should reduce O

unless U and V are at a

stationary point.

Generally speaking, it is relatively difficult to set these

step size parameters while still maintaining the non-

negativity of u

and v

. However, with the special form of

the partial derivatives, we can use some tricks to set the step

size parameters automatically. Let 

¼u

=2ðUV

VÞ

we have

þ 

¼ u



2ðUV

VÞ

¼ u



2ðUV

VÞ

ð2ðXVÞ

þ 2ðUV

VÞ

¼ u

ðXVÞ

ðUV

VÞ

ð18Þ

CAI ET AL.: GRAPH REGULARIZED NONNEGATIVE MATRIX FACTORIZATION FOR DATA REPRESENTATION 1551

Similarly, letting 

¼v

=2ðVU

U þ DVÞ

, we have

þ 

¼ v



2ðVU

U þ DVÞ

¼ v



2ðVU

U þ DVÞ

ð2ðX

UÞ

þ 2ðVU

UÞ

þ 2ðLVÞ

¼ v

ðX

U þ WVÞ

ðVU

U þ DVÞ

ð19Þ

Now, it is clear that the multiplicative updating rules in (14)

and (15) are special cases of gradient descent with an

automatic step parameter selection. The adv antage of

multiplicative updating rules is the guarantee of nonnega-

tivity of U and V. Theorem 1 also guarantees that the

multiplicative updating rules in (14) and (15) converge to a

local optimum.

3.4 Updating Rules Minim izing (7)

For the divergence formulation of GNMF, we also have two

updating rules, which can achieve a local minimum of (7):



; ð20Þ

I þ L

1



;

ð21Þ

where v

is the kth column of V and I is an N  N identity

matrix.

Similarly, we have the following theorem:

Theorem 2. The objective function O

in (7) is nonincreasing

with the updating rules in (20) and (21). The objective

function is invariant under these updates if and only if U and

V are at a stationary point.

Please see the Appendix for a detailed proof. The

updating rules in this section (minimizing the divergence

formulation of (7)) are different from the updating rules in

Section 3.2 (minimizing the F-norm formulation). For the

divergence formulation of NMF, previous studies [16]

successfully analyzed the convergence property of the

multiplicative algorithm [27] from EM algorithm’s max-

imum likelihood point of view. Such an analysis is also

valid in the GNMF case.

When  ¼ 0, it is easy to check that the updating rules

in (20) and (21) reduce to the updating rules of the

original NMF.

3.5 Computational Complexity Analysis

In this section, we discuss the extra computational cost of our

proposed algorithm in comparison to standard NMF.

Specifically, we provide the computational complexity

analysis of GNMF for both the F-Norm and KL-Divergence

formulations.

The common way to express the complexity of one

algorithm is using big O notation [10]. However, this is not

precise enough to differentiate between the complexities of

GNMF and NMF. Thus, we count the arithmetic operations

for each algorithm.

Based on the updating rules, it is not hard to count the

arithmetic operations of each iteration in NMF. We

summarize the result in Table 1. For GNMF, it is important

to note that W is a sparse matrix. If we use a p-nearest

neighbor graph, the average nonzero elements on each row

of W is p. Thus, we only need NpK flam (a floating-point

addition and multiplication) to compute WV. We also

summarize the arithmetic operations for GNMF in Table 1.

The updating rule (21) in GNMF with the divergence

formulation involves inverting a large matrix

I þ L.

In reality, there is no need to actually compute the inversion.

We only need to solve the linear equations system as follows:

I þ L



Since matrix

I þ L is symmetric, positive definite,

and sparse, we can use the iterative algorithm CG [20] to

solve this linear system of equations very efficiently. In each

iteration, CG needs to compute the matrix-vector products

in the form of ð

I þ LÞp. The remaining work load of

CG in each iteration is 4N flam. Thus, the time cost of CG in

1552 IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL. 33, NO. 8, AUGU ST 2011

TABLE 1

Computational Operation Counts for Each Iteration in NMF and GNMF

fladd: a floating-point addition, flmlt: a floating-point multiplication, fldiv: a floating-point division.

N: the number of sample points, M: the number of features, K: the number of factors.

p: the number of nearest neighbors, q: the number of iterations in Conjugate Gradient (CG).

Graph Regularized Nonnegative Matrix Factorization for Data Representation

Figures

Citations

Parameter-less Auto-weighted multiple graph regularized Nonnegative Matrix Factorization for data representation

Community preserving network embedding

Nonnegative Matrix Factorization: A Comprehensive Review

Unsupervised K-Means Clustering Algorithm

Adaptation Regularization: A General Framework for Transfer Learning

References

Maximum likelihood from incomplete data via the EM algorithm

Pattern Classification

Introduction to Algorithms

Principal Component Analysis

Nonlinear dimensionality reduction by locally linear embedding.

Related Papers (5)

Learning the parts of objects by non-negative matrix factorization

Algorithms for non-negative matrix factorization

Nonlinear dimensionality reduction by locally linear embedding.

Laplacian Eigenmaps and Spectral Techniques for Embedding and Clustering

A global geometric framework for nonlinear dimensionality reduction.

Frequently Asked Questions (12)

Q1. What have the authors contributed in "Graph regularized nonnegative matrix factorization for data representation" ?

Q2. What have the authors stated for future works in "Graph regularized nonnegative matrix factorization for data representation" ?

Q3. What is the common method of learning the parts of objects?

Q4. What can be used to construct the graph?

Q5. What are the two metrics used to measure the clustering performance?

Q6. What is the common measure for document in information retrieval community?

Q7. What is the advantage of multiplicative updating rules?

Q8. What is the popular spectral clustering algorithm?

Q9. What is the definition of a matrix factorization technique?

Q10. How many documents were kept in this experiment?

Q11. What is the difference between the two GNMF models?

Q12. What is the main reason why SVD has been used in real-world applications?