Home
/
Authors
/
Yuanlong Shao

Author

Yuanlong Shao

Other affiliations: Ohio State University

Bio: Yuanlong Shao is an academic researcher from Zhejiang University. The author has contributed to research in topics: Automatic image annotation & Semi-supervised learning. The author has an hindex of 5, co-authored 8 publications receiving 269 citations. Previous affiliations of Yuanlong Shao include Ohio State University.

Papers

PDF

Open Access

More filters

Journal Article•DOI•

Laplacian Regularized Gaussian Mixture Model for Data Clustering

[...]

Xiaofei He¹, Deng Cai¹, Yuanlong Shao¹, Hujun Bao¹, Jiawei Han² - Show less +1 more•Institutions (2)

Zhejiang University¹, University of Illinois at Urbana–Champaign²

01 Sep 2011-IEEE Transactions on Knowledge and Data Engineering

TL;DR: This paper introduces a regularized probabilistic model based on manifold structure for data clustering, called Laplacian regularized Gaussian Mixture Model (LapGMM), which is modeled by a nearest neighbor graph, and the graph structure is incorporated in the maximum likelihood objective function.

...read moreread less

Abstract: Gaussian Mixture Models (GMMs) are among the most statistically mature methods for clustering. Each cluster is represented by a Gaussian distribution. The clustering process thereby turns to estimate the parameters of the Gaussian mixture, usually by the Expectation-Maximization algorithm. In this paper, we consider the case where the probability distribution that generates the data is supported on a submanifold of the ambient space. It is natural to assume that if two points are close in the intrinsic geometry of the probability distribution, then their conditional probability distributions are similar. Specifically, we introduce a regularized probabilistic model based on manifold structure for data clustering, called Laplacian regularized Gaussian Mixture Model (LapGMM). The data manifold is modeled by a nearest neighbor graph, and the graph structure is incorporated in the maximum likelihood objective function. As a result, the obtained conditional probability distribution varies smoothly along the geodesics of the data manifold. Experimental results on real data sets demonstrate the effectiveness of the proposed approach.

...read moreread less

209 citations

Journal Article•DOI•

Video stabilization based on a 3D perspective camera model

[...]

Guofeng Zhang¹, Wei Hua¹, Xueying Qin¹, Yuanlong Shao¹, Hujun Bao¹ - Show less +1 more•Institutions (1)

Zhejiang University¹

01 Oct 2009-The Visual Computer

TL;DR: This paper presents a novel approach to stabilize video sequences based on a 3D perspective camera model that uses approximate geometry representation and analyze the resulting warping errors to show that by appropriately constraining warping error, visually plausible results can be achieved even using planar structures.

...read moreread less

Abstract: This paper presents a novel approach to stabilize video sequences based on a 3D perspective camera model. Compared to previous methods which are based on simplified models, our stabilization system can work in situations where significant depth variations exist in the scenes and the camera undergoes large translational movement. We formulate the stabilization problem as a quadratic cost function on smoothness and similarity constraints. This allows us to precisely control the smoothness by solving a sparse linear system of equations. By taking advantage of the sparseness, our optimization process is very efficient. Instead of recovering dense depths, we use approximate geometry representation and analyze the resulting warping errors. We show that by appropriately constraining warping error, visually plausible results can be achieved even using planar structures. A variety of experiments have been implemented, which demonstrates the robustness and efficiency of our approach.

...read moreread less

77 citations

Proceedings Article•DOI•

Semi-supervised topic modeling for image annotation

[...]

Yuanlong Shao¹, Yuan Zhou¹, Xiaofei He¹, Deng Cai¹, Hujun Bao¹ - Show less +1 more•Institutions (1)

Zhejiang University¹

19 Oct 2009

TL;DR: A novel technique for semi-supervised image annotation is proposed which introduces a harmonic regularizer based on the graph Laplacian of the data into the probabilistic semantic model for learning latent topics of the images.

...read moreread less

Abstract: We propose a novel technique for semi-supervised image annotation which introduces a harmonic regularizer based on the graph Laplacian of the data into the probabilistic semantic model for learning latent topics of the images. By using a probabilistic semantic model, we connect visual features and textual annotations of images by their latent topics. Meanwhile, we incorporate the manifold assumption into the model to say that the probabilities of latent topics of images are drawn from a manifold, so that for images sharing similar visual features or the same annotations, their probability distribution of latent topics should also be similar. We create a nearest neighbor graph to model the manifold and propose a regularized EM algorithm to simultaneously learn a generative model and assign probability density of latent topics to images discriminatively. In this way, databases with very few labeled images can be annotated better than previous works.

...read moreread less

16 citations

Posted Content•

A Growing Long-term Episodic & Semantic Memory

[...]

Chris Tar, Marc Pickett, Rami Eid, Yuanlong Shao

20 Oct 2016-arXiv: Artificial Intelligence

TL;DR: A lifelong learning system that leverages a fast, though non-differentiable, content-addressable memory which can be exploited to encode both a long history of sequential episodic knowledge and semantic knowledge over many episodes for an unbounded number of domains is described.

...read moreread less

Abstract: The long-term memory of most connectionist systems lies entirely in the weights of the system. Since the number of weights is typically fixed, this bounds the total amount of knowledge that can be learned and stored. Though this is not normally a problem for a neural network designed for a specific task, such a bound is undesirable for a system that continually learns over an open range of domains. To address this, we describe a lifelong learning system that leverages a fast, though non-differentiable, content-addressable memory which can be exploited to encode both a long history of sequential episodic knowledge and semantic knowledge over many episodes for an unbounded number of domains. This opens the door for investigation into transfer learning, and leveraging prior knowledge that has been learned over a lifetime of experiences to new domains.

...read moreread less

11 citations

Journal Article•DOI•

Variational inference with graph regularization for image annotation

[...]

Yuanlong Shao¹, Yuan Zhou¹, Deng Cai¹•Institutions (1)

Zhejiang University¹

24 Feb 2011-ACM Transactions on Intelligent Systems and Technology

TL;DR: The graph regularization approaches are extended to a more general case where the regularization is imposed on the factorized variational distributions, instead of posterior distributions implicitly involved in EM-like algorithms.

...read moreread less

Abstract: Image annotation is a typical area where there are multiple types of attributes associated with each individual image. In order to achieve better performance, it is important to develop effective modeling by utilizing prior knowledge. In this article, we extend the graph regularization approaches to a more general case where the regularization is imposed on the factorized variational distributions, instead of posterior distributions implicitly involved in EM-like algorithms. In this way, the problem modeling can be more flexible, and we can choose any factor in the problem domain to impose graph regularization wherever there are similarity constraints among the instances. We formulate the problem formally and show its geometrical background in manifold learning. We also design two practically effective algorithms and analyze their properties such as the convergence. Finally, we apply our approach to image annotation and show the performance improvement of our algorithm.

...read moreread less

6 citations

Cited by

PDF

Open Access

More filters

Posted Content•

Learning without Forgetting

[...]

Zhizhong Li¹, Derek Hoiem¹•Institutions (1)

University of Illinois at Urbana–Champaign¹

29 Jun 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: This work proposes the Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities, and performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques.

...read moreread less

Abstract: When building a unified vision system or gradually adding new capabilities to a system, the usual assumption is that training data for all tasks is always available. However, as the number of tasks grows, storing and retraining on such data becomes infeasible. A new problem arises where we add new capabilities to a Convolutional Neural Network (CNN), but the training data for its existing capabilities are unavailable. We propose our Learning without Forgetting method, which uses only new task data to train the network while preserving the original capabilities. Our method performs favorably compared to commonly used feature extraction and fine-tuning adaption techniques and performs similarly to multitask learning that uses original task data we assume unavailable. A more surprising observation is that Learning without Forgetting may be able to replace fine-tuning with similar old and new task datasets for improved new task performance.

...read moreread less

1,037 citations

Posted Content•

Variational Deep Embedding: An Unsupervised and Generative Approach to Clustering

[...]

Zhuxi Jiang¹, Yin Zheng², Huachun Tan¹, Bangsheng Tang, Hanning Zhou - Show less +1 more•Institutions (2)

Beijing Institute of Technology¹, Tencent²

16 Nov 2016-arXiv: Computer Vision and Pattern Recognition

TL;DR: In this article, the authors proposed Variational Deep Embedding (VaDE), a novel unsupervised generative clustering approach within the framework of Variational Auto-Encoder (VAE).

...read moreread less

Abstract: Clustering is among the most fundamental tasks in computer vision and machine learning. In this paper, we propose Variational Deep Embedding (VaDE), a novel unsupervised generative clustering approach within the framework of Variational Auto-Encoder (VAE). Specifically, VaDE models the data generative procedure with a Gaussian Mixture Model (GMM) and a deep neural network (DNN): 1) the GMM picks a cluster; 2) from which a latent embedding is generated; 3) then the DNN decodes the latent embedding into observables. Inference in VaDE is done in a variational way: a different DNN is used to encode observables to latent embeddings, so that the evidence lower bound (ELBO) can be optimized using Stochastic Gradient Variational Bayes (SGVB) estimator and the reparameterization trick. Quantitative comparisons with strong baselines are included in this paper, and experimental results show that VaDE significantly outperforms the state-of-the-art clustering methods on 4 benchmarks from various modalities. Moreover, by VaDE's generative nature, we show its capability of generating highly realistic samples for any specified cluster, without using supervised information during training. Lastly, VaDE is a flexible and extensible framework for unsupervised generative clustering, more general mixture models than GMM can be easily plugged in.

...read moreread less

401 citations

Journal Article•DOI•

Laplacian Regularized Low-Rank Representation and Its Applications

[...]

Ming Yin¹, Junbin Gao², Zhouchen Lin³•Institutions (3)

Guangdong University of Technology¹, Charles Sturt University², Peking University³

01 Mar 2016-IEEE Transactions on Pattern Analysis and Machine Intelligence

TL;DR: The proposed general Laplacian regularized low-rank representation framework for data representation takes advantage of the graph regularizer and can represent the global low-dimensional structures, but also capture the intrinsic non-linear geometric information in data.

...read moreread less

Abstract: Low-rank representation (LRR) has recently attracted a great deal of attention due to its pleasing efficacy in exploring low-dimensional subspace structures embedded in data. For a given set of observed data corrupted with sparse errors, LRR aims at learning a lowest-rank representation of all data jointly. LRR has broad applications in pattern recognition, computer vision and signal processing. In the real world, data often reside on low-dimensional manifolds embedded in a high-dimensional ambient space. However, the LRR method does not take into account the non-linear geometric structures within data, thus the locality and similarity information among data may be missing in the learning process. To improve LRR in this regard, we propose a general Laplacian regularized low-rank representation framework for data representation where a hypergraph Laplacian regularizer can be readily introduced into, i.e., a Non-negative Sparse Hyper-Laplacian regularized LRR model (NSHLRR). By taking advantage of the graph regularizer, our proposed method not only can represent the global low-dimensional structures, but also capture the intrinsic non-linear geometric information in data. The extensive experimental results on image clustering, semi-supervised image classification and dimensionality reduction tasks demonstrate the effectiveness of the proposed method.

...read moreread less

367 citations

Proceedings Article•DOI•

Learning Community Embedding with Community Detection and Node Embedding on Graphs

[...]

Sandro Cavallari¹, Vincent W. Zheng², Hongyun Cai², Kevin Chen-Chuan Chang³, Erik Cambria¹ - Show less +1 more•Institutions (3)

Nanyang Technological University¹, Agency for Science, Technology and Research², University of Illinois at Urbana–Champaign³

06 Nov 2017

TL;DR: This paper studies an important yet largely under-explored setting of graph embedding, i.e., embedding communities instead of each individual nodes, and proposes a novel community embedding framework that jointly solves the three tasks together.

...read moreread less

Abstract: In this paper, we study an important yet largely under-explored setting of graph embedding, i.e., embedding communities instead of each individual nodes. We find that community embedding is not only useful for community-level applications such as graph visualization, but also beneficial to both community detection and node classification. To learn such embedding, our insight hinges upon a closed loop among community embedding, community detection and node embedding. On the one hand, node embedding can help improve community detection, which outputs good communities for fitting better community embedding. On the other hand, community embedding can be used to optimize the node embedding by introducing a community-aware high-order proximity. Guided by this insight, we propose a novel community embedding framework that jointly solves the three tasks together. We evaluate such a framework on multiple real-world datasets, and show that it improves graph visualization and outperforms state-of-the-art baselines in various application tasks, e.g., community detection and node classification.

...read moreread less

345 citations

Journal Article•DOI•

Subspace video stabilization

[...]

Feng Liu¹, Michael Gleicher², Jue Wang³, Hailin Jin³, Aseem Agarwala³ - Show less +1 more•Institutions (3)

Portland State University¹, University of Wisconsin-Madison², Adobe Systems³

02 Feb 2011-ACM Transactions on Graphics

TL;DR: This article focuses on the problem of transforming a set of input 2D motion trajectories so that they are both smooth and resemble visually plausible views of the imaged scene, and offers the first method that both achieves high-quality video stabilization and is practical enough for consumer applications.

...read moreread less

Abstract: We present a robust and efficient approach to video stabilization that achieves high-quality camera motion for a wide range of videos. In this article, we focus on the problem of transforming a set of input 2D motion trajectories so that they are both smooth and resemble visually plausible views of the imaged scene; our key insight is that we can achieve this goal by enforcing subspace constraints on feature trajectories while smoothing them. Our approach assembles tracked features in the video into a trajectory matrix, factors it into two low-rank matrices, and performs filtering or curve fitting in a low-dimensional linear space. In order to process long videos, we propose a moving factorization that is both efficient and streamable. Our experiments confirm that our approach can efficiently provide stabilization results comparable with prior 3D methods in cases where those methods succeed, but also provides smooth camera motions in cases where such approaches often fail, such as videos that lack parallax. The presented approach offers the first method that both achieves high-quality video stabilization and is practical enough for consumer applications.

...read moreread less

318 citations

1
2
3
4
…
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
58
59
60
61
62
63
64
65

Collapse