scispace - formally typeset
Open AccessJournal ArticleDOI

Identifying Different Transportation Modes from Trajectory Data Using Tree-Based Ensemble Classifiers

Zhibin Xiao, +3 more
- 22 Feb 2017 - 
- Vol. 6, Iss: 2, pp 57
TLDR
An approach based on ensemble learning is proposed to infer hybrid transportation modes using only Global Position System (GPS) data and tree-based ensemble models were used instead of traditional methods to classify the different transportation modes.
Abstract
Recognition of transportation modes can be used in different applications including human behavior research, transport management and traffic control. Previous work on transportation mode recognition has often relied on using multiple sensors or matching Geographic Information System (GIS) information, which is not possible in many cases. In this paper, an approach based on ensemble learning is proposed to infer hybrid transportation modes using only Global Position System (GPS) data. First, in order to distinguish between different transportation modes, we used a statistical method to generate global features and extract several local features from sub-trajectories after trajectory segmentation, before these features were combined in the classification stage. Second, to obtain a better performance, we used tree-based ensemble models (Random Forest, Gradient Boosting Decision Tree, and XGBoost) instead of traditional methods (K-Nearest Neighbor, Decision Tree, and Support Vector Machines) to classify the different transportation modes. The experiment results on the later have shown the efficacy of our proposed approach. Among them, the XGBoost model produced the best performance with a classification accuracy of 90.77% obtained on the GEOLIFE dataset, and we used a tree-based ensemble method to ensure accurate feature selection to reduce the model complexity.

read more

Citations
More filters
Journal ArticleDOI

Data fusion and multiple classifier systems for human activity detection and health monitoring: Review and open research directions

TL;DR: The focus of this review is to provide in-depth and comprehensive analysis of data fusion and multiple classifier systems techniques for human activity recognition with emphasis on mobile and wearable devices.
Journal ArticleDOI

Inferring transportation modes from GPS trajectories using a convolutional neural network

TL;DR: This research contrasts the methodology with traditional machine learning algorithms as well as the seminal and most related studies to demonstrate the superiority of the CNN framework.
Journal ArticleDOI

Exploring Human Mobility Patterns in Urban Scenarios: A Trajectory Data Perspective

TL;DR: An integrated computing method to rescale heterogeneous traffic trajectory data, which leverages MLE and BIC is proposed and several important human mobility patterns are obtained and quite a few interesting phenomena are discovered, which lay a solid foundation for future research.
Journal ArticleDOI

Enabling Reproducible Research in Sensor-Based Transportation Mode Recognition With the Sussex-Huawei Dataset

TL;DR: A systematic study of the relevance of statistical and frequency features based on the information theoretical criteria to inform recognition systems and systematically reports the reference performance obtained on all the identified recognition scenarios using a machine-learning recognition pipeline.
Journal ArticleDOI

Real-time accident detection: Coping with imbalanced data.

TL;DR: This study compares the performance of two popular machine learning models, Support Vector Machine (SVM) and Probabilistic Neural Network (PNN), to detect the occurrence of accidents on the Eisenhower expressway in Chicago, and shows that although SVM achieves overall higher accuracy, PNN outperforms SVM regarding the Detection Rate.
References
More filters
Journal ArticleDOI

Greedy function approximation: A gradient boosting machine.

TL;DR: A general gradient descent boosting paradigm is developed for additive expansions based on any fitting criterion, and specific algorithms are presented for least-squares, least absolute deviation, and Huber-M loss functions for regression, and multiclass logistic likelihood for classification.
Proceedings ArticleDOI

XGBoost: A Scalable Tree Boosting System

TL;DR: XGBoost as discussed by the authors proposes a sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning to achieve state-of-the-art results on many machine learning challenges.

Classification and Regression by randomForest

TL;DR: random forests are proposed, which add an additional layer of randomness to bagging and are robust against overfitting, and the randomForest package provides an R interface to the Fortran programs by Breiman and Cutler.
Proceedings ArticleDOI

XGBoost: A Scalable Tree Boosting System

TL;DR: This paper proposes a novel sparsity-aware algorithm for sparse data and weighted quantile sketch for approximate tree learning and provides insights on cache access patterns, data compression and sharding to build a scalable tree boosting system called XGBoost.
Journal ArticleDOI

Neural network ensembles

TL;DR: It is shown that the remaining residual generalization error can be reduced by invoking ensembles of similar networks, which helps improve the performance and training of neural networks for classification.
Related Papers (5)