scispace - formally typeset
Open AccessBook ChapterDOI

Within-network classification using local structure similarity

TLDR
This paper presents a collective classification method, based on relaxation labeling, that classifies entities of a network using their local structure using a marginalized similarity kernel that compares the local structure of two nodes with random walks in the network.
Abstract
Within-network classification, where the goal is to classify the nodes of a partly labeled network, is a semi-supervised learning problem that has applications in several important domains like image processing, the classification of documents, and the detection of malicious activities. While most methods for this problem infer the missing labels collectively based on the hypothesis that linked or nearby nodes are likely to have the same labels, there are many types of networks for which this assumption fails, e.g., molecular graphs, trading networks, etc. In this paper, we present a collective classification method, based on relaxation labeling, that classifies entities of a network using their local structure. This method uses a marginalized similarity kernel that compares the local structure of two nodes with random walks in the network. Through experimentation on different datasets, we show our method to be more accurate than several state-of-the-art approaches for this problem.

read more

Content maybe subject to copyright    Report

Citations
More filters
Journal ArticleDOI

Propagation kernels: efficient graph kernels from propagated information

TL;DR: It is shown that if the graphs at hand have a regular structure, one can exploit this regularity to scale the kernel computation to large databases of graphs with thousands of nodes, and can be considerably faster than state-of-the-art approaches without sacrificing predictive performance.
Journal ArticleDOI

Label-dependent node classification in the network

TL;DR: A new approach of sampling algorithm-LDGibbs, used in the context of collective classification with application of label-dependent features, is proposed in the paper in order to provide more accurate generalization for sparse datasets.
Journal ArticleDOI

Multi-type clustering and classification from heterogeneous networks

TL;DR: The algorithm HENPC is proposed, which is able to work on heterogeneous networks with an arbitrary structure, and extracts possibly overlapping and hierarchically-organized heterogeneous clusters and exploits them for predictive purposes.
Book ChapterDOI

Node Classification in Social Network via a Factor Graph Model

TL;DR: This work uses a factor graph model with partially-labeled data to solve the task of node classification in social networks and shows that the model works much better than the traditional classification algorithms.
Proceedings ArticleDOI

Mining frequent neighborhood patterns in a large labeled graph

TL;DR: This paper proposes a new class of patterns called frequent neighborhood patterns, which are free from the "DCP-intuitiveness" dilemma of mining frequent subgraphs in a single graph, and shows that the new patterns not only maintain DCP, but also have equally significant interpretations as subgraph patterns.
References
More filters
Journal ArticleDOI

Stochastic Relaxation, Gibbs Distributions, and the Bayesian Restoration of Images

TL;DR: The analogy between images and statistical mechanics systems is made and the analogous operation under the posterior distribution yields the maximum a posteriori (MAP) estimate of the image given the degraded observations, creating a highly parallel ``relaxation'' algorithm for MAP estimation.
Proceedings Article

Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data

TL;DR: This work presents iterative parameter estimation algorithms for conditional random fields and compares the performance of the resulting models to HMMs and MEMMs on synthetic and natural-language data.
Journal ArticleDOI

On the statistical analysis of dirty pictures

TL;DR: In this paper, the authors proposed an iterative method for scene reconstruction based on a non-degenerate Markov Random Field (MRF) model, where the local characteristics of the original scene can be represented by a nondegenerate MRF and the reconstruction can be estimated according to standard criteria.
Book

Graphical Models, Exponential Families, and Variational Inference

TL;DR: The variational approach provides a complementary alternative to Markov chain Monte Carlo as a general source of approximation methods for inference in large-scale statistical models.
Related Papers (5)