scispace - formally typeset
Search or ask a question
Author

Jiangchuan Zheng

Bio: Jiangchuan Zheng is an academic researcher from Hong Kong University of Science and Technology. The author has contributed to research in topics: Collaborative filtering & Population. The author has an hindex of 9, co-authored 10 publications receiving 307 citations.

Papers
More filters
Proceedings Article
14 Jul 2013
TL;DR: This paper unify multiple different trajectory regression problems in a multi-task framework by introducing a novel crosstask regularization which encourages temporal smoothness on the change of road travel costs and proposes an efficient block coordinate descent method to solve the resulting problem.
Abstract: Road travel costs are important knowledge hidden in large-scale GPS trajectory data sets, the discovery of which can benefit many applications such as intelligent route planning and automatic driving navigation. While there are previous studies which tackled this task by modeling it as a regression problem with spatial smoothness taken into account, they unreasonably assumed that the latent cost of each road remains unchanged over time. Other works on route planning and recommendation that have considered temporal factors simply assumed that the temporal dynamics be known in advance as a parametric function over time, which is not faithful to reality. To overcome these limitations, in this paper, we propose an extension to a previous static trajectory regression framework by learning the temporal dynamics of road travel costs in an innovative non-parametric manner which can effectively overcome the temporal sparsity problem. In particular, we unify multiple different trajectory regression problems in a multi-task framework by introducing a novel crosstask regularization which encourages temporal smoothness on the change of road travel costs. We then propose an efficient block coordinate descent method to solve the resulting problem by exploiting its separable structures and prove its convergence to global optimum. Experiments conducted on both synthetic and real data sets demonstrate the effectiveness of our method and its improved accuracy on travel time prediction.

87 citations

Proceedings ArticleDOI
05 Sep 2012
TL;DR: A probabilistic framework which can automatically learn characteristic behavior patterns in users' daily lives from mass amount of mobile data in unsupervised setting and exploit it to predict user activities is proposed.
Abstract: Human behavior understanding is a fundamental problem in many ubiquitous applications It aims to automatically uncover and quantify characteristic behavior patterns in users' daily lives as well as disclose behavior clustering structure among multiple users The key challenge is how to define a naturally interpreted representation for users' daily behavior patterns, which can be easily exploited to not only uncover the behavior similarity among multiple users but also predict users' future activities In this paper, we define such a representation, and propose a probabilistic framework which can automatically learn it from mass amount of mobile data in unsupervised setting and exploit it to predict user activities By an appropriate information sharing among multiple users, this framework overcomes single-user data sparsity problem and effectively identifies behavior clustering structures in a set of users Experiments conducted on a public reality mining data set demonstrate the effectiveness and accuracy of our methods

68 citations

Proceedings Article
27 Jul 2014
TL;DR: This paper develops a robust IRL framework that can accurately estimate the reward function in the presence of behavior noise, and introduces a novel latent variable characterizing the reliability of each expert action and uses Laplace distribution as its prior.
Abstract: Inverse reinforcement learning (IRL) aims to recover the reward function underlying a Markov Decision Process from behaviors of experts in support of decision-making. Most recent work on IRL assumes the same level of trustworthiness of all expert behaviors, and frames IRL as a process of seeking reward function that makes those behaviors appear (near)- optimal. However, it is common in reality that noisy expert behaviors disobeying the optimal policy exist, which may degrade the IRL performance significantly. To address this issue, in this paper, we develop a robust IRL framework that can accurately estimate the reward function in the presence of behavior noise. In particular, we focus on a special type of behavior noise referred to as sparse noise due to its wide popularity in real-world behavior data. To model such noise, we introduce a novel latent variable characterizing the reliability of each expert action and use Laplace distribution as its prior. We then devise an EM algorithm with a novel variational inference procedure in the E-step, which can automatically identify and remove behavior noise in reward learning. Experiments on both synthetic data and real vehicle routing data with noticeable behavior noise show significant improvement of our method over previous approaches in learning accuracy, and also show its power in de-noising behavior data.

63 citations

Proceedings ArticleDOI
08 Sep 2013
TL;DR: A Bayesian model of mobility in populations (i.e., groups without spatial or social interconnections) that intelligently shares temporal parameters between people, but keeps the spatial parameters specific to individuals is presented.
Abstract: In recent years, researchers have sought to capture the daily life location behaviour of groups of people for exploratory, inference, and predictive purposes. However, development of such approaches has been limited by the requirement of personal semantic labels for locations or social/spatial overlap between individuals in the group. To address this shortcoming, we present a Bayesian model of mobility in populations (i.e., groups without spatial or social interconnections) that is not subject to any of these requirements. The model intelligently shares temporal parameters between people, but keeps the spatial parameters specific to individuals. To illustrate the advantages of population modelling, we apply our model to the difficult problem of overcoming data sparsity in location prediction systems, using the Nokia dataset comprising 38 individuals, and find a factor of 2.4 improvement in location prediction performance against a state-of-the-art model when training on only 20 hours of observations.

44 citations

Proceedings ArticleDOI
18 Mar 2013
TL;DR: Although a single user's mobile data is far from sufficient to reveal his characteristic behavior, it is shown that when exploiting a large number of users' mobile data in a principled collaborative way which facilitate similar users' data to complement each other, representative routine patterns can be revealed and each user can be characterized properly.
Abstract: Recognizing and classifying users' routine behavior patterns from sensor data has been a hot topic in pervasive computing. Its objective is to automatically discover recurrent routine patterns in a user's daily life by leveraging the multimodal data generated from wearable sensors such as mobile phones. This kind of knowledge can be utilized in many ways such as identifying similar users in terms of their behaviors, providing behavior contexts to enable advanced human-centered applications, etc. While numerous works have been done in this area, most of them rely on densely sampled mobile data collected from specially-programmed sensors that can “follow” people throughout the day. In this paper, we study how to achieve the same objective when the mobile data presented is much sparser, such as traditional mobile phone data where a user's location is reported only when he makes a call. Although a single user's mobile data is far from sufficient to reveal his characteristic behavior, we show that when exploiting a large number of users' mobile data in a principled collaborative way which facilitate similar users' data to complement each other, representative routine patterns can be revealed and each user can be characterized properly. Experiments on synthetic and real mobile phone data set demonstrate the effectiveness of our methods, and also show our model's ability in predicting human activity using the patterns learned.

22 citations


Cited by
More filters
Christopher M. Bishop1
01 Jan 2006
TL;DR: Probability distributions of linear models for regression and classification are given in this article, along with a discussion of combining models and combining models in the context of machine learning and classification.
Abstract: Probability Distributions.- Linear Models for Regression.- Linear Models for Classification.- Neural Networks.- Kernel Methods.- Sparse Kernel Machines.- Graphical Models.- Mixture Models and EM.- Approximate Inference.- Sampling Methods.- Continuous Latent Variables.- Sequential Data.- Combining Models.

10,141 citations

Posted Content
TL;DR: Multi-task learning (MTL) as mentioned in this paper is a learning paradigm in machine learning and its aim is to leverage useful information contained in multiple related tasks to help improve the generalization performance of all the tasks.
Abstract: Multi-Task Learning (MTL) is a learning paradigm in machine learning and its aim is to leverage useful information contained in multiple related tasks to help improve the generalization performance of all the tasks. In this paper, we give a survey for MTL from the perspective of algorithmic modeling, applications and theoretical analyses. For algorithmic modeling, we give a definition of MTL and then classify different MTL algorithms into five categories, including feature learning approach, low-rank approach, task clustering approach, task relation learning approach and decomposition approach as well as discussing the characteristics of each approach. In order to improve the performance of learning tasks further, MTL can be combined with other learning paradigms including semi-supervised learning, active learning, unsupervised learning, reinforcement learning, multi-view learning and graphical models. When the number of tasks is large or the data dimensionality is high, we review online, parallel and distributed MTL models as well as dimensionality reduction and feature hashing to reveal their computational and storage advantages. Many real-world applications use MTL to boost their performance and we review representative works in this paper. Finally, we present theoretical analyses and discuss several future directions for MTL.

1,202 citations

Posted Content
TL;DR: Wang et al. as discussed by the authors proposed a Deep Multi-View Spatial-Temporal Network (DMVST-Net) framework to model both spatial and temporal relations, which can help the city pre-allocate resources to meet travel demand and to reduce empty taxis on streets which waste energy and worsen the traffic congestion.
Abstract: Taxi demand prediction is an important building block to enabling intelligent transportation systems in a smart city. An accurate prediction model can help the city pre-allocate resources to meet travel demand and to reduce empty taxis on streets which waste energy and worsen the traffic congestion. With the increasing popularity of taxi requesting services such as Uber and Didi Chuxing (in China), we are able to collect large-scale taxi demand data continuously. How to utilize such big data to improve the demand prediction is an interesting and critical real-world problem. Traditional demand prediction methods mostly rely on time series forecasting techniques, which fail to model the complex non-linear spatial and temporal relations. Recent advances in deep learning have shown superior performance on traditionally challenging tasks such as image classification by learning the complex features and correlations from large-scale data. This breakthrough has inspired researchers to explore deep learning techniques on traffic prediction problems. However, existing methods on traffic prediction have only considered spatial relation (e.g., using CNN) or temporal relation (e.g., using LSTM) independently. We propose a Deep Multi-View Spatial-Temporal Network (DMVST-Net) framework to model both spatial and temporal relations. Specifically, our proposed model consists of three views: temporal view (modeling correlations between future demand values with near time points via LSTM), spatial view (modeling local spatial correlation via local CNN), and semantic view (modeling correlations among regions sharing similar temporal patterns). Experiments on large-scale real taxi demand data demonstrate effectiveness of our approach over state-of-the-art methods.

302 citations

Proceedings ArticleDOI
13 Sep 2014
TL;DR: The results show that the proposed solution significantly outperforms three baseline algorithms by selecting 10.0% -- 73.5% fewer participants on average under the same probabilistic coverage constraint.
Abstract: This paper proposes a novel participant selection framework, named CrowdRecruiter, for mobile crowdsensing. CrowdRecruiter operates on top of energy-efficient Piggyback Crowdsensing (PCS) task model and minimizes incentive payments by selecting a small number of participants while still satisfying probabilistic coverage constraint. In order to achieve the objective when piggybacking crowdsensing tasks with phone calls, CrowdRecruiter first predicts the call and coverage probability of each mobile user based on historical records. It then efficiently computes the joint coverage probability of multiple users as a combined set and selects the near-minimal set of participants, which meets coverage ratio requirement in each sensing cycle of the PCS task. We evaluated CrowdRecruiter extensively using a large-scale real-world dataset and the results show that the proposed solution significantly outperforms three baseline algorithms by selecting 10.0% -- 73.5% fewer participants on average under the same probabilistic coverage constraint.

248 citations

Proceedings Article
12 Feb 2016
TL;DR: This work scientifically and systematically study the feasibility of career path prediction from social network data and seamlessly fuse information from multiple social networks to comprehensively describe a user and characterize progressive properties of his or her career path.
Abstract: People go to fortune tellers in hopes of learning things about their future. A future career path is one of the topics most frequently discussed. But rather than rely on "black arts" to make predictions, in this work we scientifically and systematically study the feasibility of career path prediction from social network data. In particular, we seamlessly fuse information from multiple social networks to comprehensively describe a user and characterize progressive properties of his or her career path. This is accomplished via a multi-source learning framework with fused lasso penalty, which jointly regularizes the source and career-stage relatedness. Extensive experiments on real-world data confirm the accuracy of our model.

234 citations