scispace - formally typeset
Proceedings ArticleDOI

User identification in cyber-physical space: a case study on mobile query logs and trajectories

TLDR
A novel framework for user identification in cyber-physical space is proposed, which consists of three key steps: modeling the location distribution of each IP address; computing the co-occurrence with an inverted index to reduce the space and time cost; and a learning-to-rank tactic to fuse user's features shared in both spaces to improve the accuracy.
Abstract
User identification across domains draws lots of research effort in recent years. Although most of existing works focus on user identification in a single space, in this paper, we first try to identify users by fusing their activities in cyber space and physical space, which helps us obtain a comprehensive understanding about users' online behaviours as well as offline visitation. Out profound insight to tackle this problem is that we can build a connection between the cyber space and the physical space with the stable location distribution of IP addresses. Thus, we propose a novel framework for user identification in cyber-physical space, which consists of three key steps: 1) modeling the location distribution of each IP address; 2) computing the co-occurrence with an inverted index to reduce the space and time cost; and 3) a learning-to-rank tactic to fuse user's features shared in both spaces to improve the accuracy. We conduct experiments to identify individual users from mobile query logs (generated in cyber space) and trajectory data (generated in physical space) to demonstrate the efficiency and effectiveness of our framework.

read more

Citations
More filters
Journal ArticleDOI

Human-in-the-Loop Mobile Networks: A Survey of Recent Advancements

TL;DR: The goal of this survey is to summarize recent results that focus on understanding and exploiting the human factor in mobile networks and to discuss novelties of these formulations, adopted methodologies, and interesting results.
Journal ArticleDOI

High-performance spatiotemporal trajectory matching across heterogeneous data sources

TL;DR: The experimental results illustrate that the proposed approach outperforms other similarity metrics in accuracy, and the Spark-based framework greatly improves the efficiency in spatiotemporal trajectory matching.
Journal ArticleDOI

A Survey of Across Social Networks User Identification

TL;DR: This paper first systematically introduces the application of ASNUI in the field of social computing, then states its applications and challenges, and reviews the adopted models, frameworks, and performance comparison state-of-the-art techniques used inASNUI.
Journal ArticleDOI

User Identification across Asynchronous Mobility Trajectories

TL;DR: This paper advocates an identification resolution method based on the most frequently distributed TOP-N regions regarding user trajectories, which is substantially effective and efficiency for user identification.
Proceedings Article

Redefining the Offline Retail Experience: Designing Product Recommendation Systems for Fashion Stores

TL;DR: Preliminary analyses indicate that sensor information regarding garment and user identification, as well as further context data help to improve product recommendations in fashion stores.
References
More filters
Journal ArticleDOI

MapReduce: simplified data processing on large clusters

TL;DR: This paper presents the implementation of MapReduce, a programming model and an associated implementation for processing and generating large data sets that runs on a large cluster of commodity machines and is highly scalable.
Journal ArticleDOI

MapReduce: simplified data processing on large clusters

TL;DR: This presentation explains how the underlying runtime system automatically parallelizes the computation across large-scale clusters of machines, handles machine failures, and schedules inter-machine communication to make efficient use of the network and disks.
Proceedings Article

A density-based algorithm for discovering clusters a density-based algorithm for discovering clusters in large spatial databases with noise

TL;DR: In this paper, a density-based notion of clusters is proposed to discover clusters of arbitrary shape, which can be used for class identification in large spatial databases and is shown to be more efficient than the well-known algorithm CLAR-ANS.
Proceedings ArticleDOI

XGBoost: A Scalable Tree Boosting System

TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.
Proceedings Article

A density-based algorithm for discovering clusters in large spatial Databases with Noise

TL;DR: DBSCAN, a new clustering algorithm relying on a density-based notion of clusters which is designed to discover clusters of arbitrary shape, is presented which requires only one input parameter and supports the user in determining an appropriate value for it.
Related Papers (5)
Trending Questions (1)
Can Cyber Cell track your IP address?

Out profound insight to tackle this problem is that we can build a connection between the cyber space and the physical space with the stable location distribution of IP addresses.