Proceedings ArticleDOI
User identification in cyber-physical space: a case study on mobile query logs and trajectories
Tianyi Hao,Jingbo Zhou,Yunsheng Cheng,Longbo Huang,Haishan Wu +4 more
- pp 71
TLDR
A novel framework for user identification in cyber-physical space is proposed, which consists of three key steps: modeling the location distribution of each IP address; computing the co-occurrence with an inverted index to reduce the space and time cost; and a learning-to-rank tactic to fuse user's features shared in both spaces to improve the accuracy.Abstract:
User identification across domains draws lots of research effort in recent years. Although most of existing works focus on user identification in a single space, in this paper, we first try to identify users by fusing their activities in cyber space and physical space, which helps us obtain a comprehensive understanding about users' online behaviours as well as offline visitation. Out profound insight to tackle this problem is that we can build a connection between the cyber space and the physical space with the stable location distribution of IP addresses. Thus, we propose a novel framework for user identification in cyber-physical space, which consists of three key steps: 1) modeling the location distribution of each IP address; 2) computing the co-occurrence with an inverted index to reduce the space and time cost; and 3) a learning-to-rank tactic to fuse user's features shared in both spaces to improve the accuracy. We conduct experiments to identify individual users from mobile query logs (generated in cyber space) and trajectory data (generated in physical space) to demonstrate the efficiency and effectiveness of our framework.read more
Citations
More filters
Journal ArticleDOI
Human-in-the-Loop Mobile Networks: A Survey of Recent Advancements
TL;DR: The goal of this survey is to summarize recent results that focus on understanding and exploiting the human factor in mobile networks and to discuss novelties of these formulations, adopted methodologies, and interesting results.
Journal ArticleDOI
High-performance spatiotemporal trajectory matching across heterogeneous data sources
TL;DR: The experimental results illustrate that the proposed approach outperforms other similarity metrics in accuracy, and the Spark-based framework greatly improves the efficiency in spatiotemporal trajectory matching.
Journal ArticleDOI
A Survey of Across Social Networks User Identification
TL;DR: This paper first systematically introduces the application of ASNUI in the field of social computing, then states its applications and challenges, and reviews the adopted models, frameworks, and performance comparison state-of-the-art techniques used inASNUI.
Journal ArticleDOI
User Identification across Asynchronous Mobility Trajectories
TL;DR: This paper advocates an identification resolution method based on the most frequently distributed TOP-N regions regarding user trajectories, which is substantially effective and efficiency for user identification.
Proceedings Article
Redefining the Offline Retail Experience: Designing Product Recommendation Systems for Fashion Stores
TL;DR: Preliminary analyses indicate that sensor information regarding garment and user identification, as well as further context data help to improve product recommendations in fashion stores.
References
More filters
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI
MapReduce: simplified data processing on large clusters
Jeffrey Dean,Sanjay Ghemawat +1 more
TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
Proceedings Article
A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise
TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.
Proceedings ArticleDOI
XGBoost: A Scalable Tree Boosting System
Tianqi Chen,Carlos Guestrin +1 more
TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.
Proceedings Article
A density-based algorithm for discovering clusters in large spatial Databases with Noise
TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.