Showing papers by "Xiaolei Ma published in 2016"

PDF

Open Access

Journal Article•DOI•

Predicting Short-Term Subway Ridership and Prioritizing Its Influential Factors Using Gradient Boosting Decision Trees

[...]

Chuan Ding¹, Chuan Ding², Donggen Wang, Xiaolei Ma¹, Haiying Li² - Show less +1 more•Institutions (2)

Beihang University¹, Beijing Jiaotong University²

28 Oct 2016-Sustainability

TL;DR: In this paper, a relatively recent data mining approach called gradient boosting decision trees (GBDT) is applied to short-term subway ridership prediction and used to capture the associations with the independent variables.

...read moreread less

Abstract: Understanding the relationship between short-term subway ridership and its influential factors is crucial to improving the accuracy of short-term subway ridership prediction Although there has been a growing body of studies on short-term ridership prediction approaches, limited effort is made to investigate the short-term subway ridership prediction considering bus transfer activities and temporal features To fill this gap, a relatively recent data mining approach called gradient boosting decision trees (GBDT) is applied to short-term subway ridership prediction and used to capture the associations with the independent variables Taking three subway stations in Beijing as the cases, the short-term subway ridership and alighting passengers from its adjacent bus stops are obtained based on transit smart card data To optimize the model performance with different combinations of regularization parameters, a series of GBDT models are built with various learning rates and tree complexities by fitting a maximum of trees The optimal model performance confirms that the gradient boosting approach can incorporate different types of predictors, fit complex nonlinear relationships, and automatically handle the multicollinearity effect with high accuracy In contrast to other machine learning methods—or “black-box” procedures—the GBDT model can identify and rank the relative influences of bus transfer activities and temporal features on short-term subway ridership These findings suggest that the GBDT model has considerable advantages in improving short-term subway ridership prediction in a multimodal public transportation system

...read moreread less

123 citations

Journal Article•DOI•

Headway-based bus bunching prediction using transit smart card data

[...]

Haiyang Yu¹, Chen Dongwei¹, Wu Zhihai¹, Xiaolei Ma¹, Yunpeng Wang¹ - Show less +1 more•Institutions (1)

Beihang University¹

01 Nov 2016-Transportation Research Part C-emerging Technologies

TL;DR: A predictive framework to capture the stop-level headway irregularity based on transit smart card data can provide timely and accurate information for potential bus bunching prevention and inform passengers when the next bus will arrive and will greatly increase transit ridership and reduce operating costs for transit authorities.

...read moreread less

Abstract: Bus bunching severely deteriorates the quality of transit service with poor on-time performance and excessive waiting time. To mitigate bus bunching, this paper presents a predictive framework to capture the stop-level headway irregularity based on transit smart card data. Historical headway, passenger demands, and travel time are utilized to model the headway fluctuation at the following stops. A Least Squares Support Vector Machine regression is established to detect bus bunching with the predicted headway pattern. An empirical experiment with two bus routes in Beijing is conducted to demonstrate the effectiveness of the proposed approach. The predictive method can successfully identify more than 95% of bus bunching occurrences in comparison with other well-established prediction algorithms. Moreover, the detection accuracy does not significantly deteriorate as the prediction lead time increases. Instead of regularizing the headways at all costs by adopting certain correction actions, the proposed framework can provide timely and accurate information for potential bus bunching prevention and inform passengers when the next bus will arrive. This feature will greatly increase transit ridership and reduce operating costs for transit authorities.

...read moreread less

64 citations

Journal Article•DOI•

Understanding Freight Trip-Chaining Behavior Using a Spatial Data-Mining Approach with GPS Data:

[...]

Xiaolei Ma¹, Yong Wang², Edward McCormack³, Yinhai Wang³•Institutions (3)

Beihang University¹, Chongqing Jiaotong University², University of Washington³

11 Aug 2016-Transportation Research Record

TL;DR: In this paper, a series of data-mining algorithms to extract an individual truck's trip-chaining information from multiday GPS data was presented, which showed that 51% of the trucks in the data set had at least one trip chain.

...read moreread less

Abstract: Freight systems are a critical yet complex component of the transportation domain. Understanding the dynamic of freight movements will help in better management of freight demand and eventually improve freight system efficiency. This paper presents a series of data-mining algorithms to extract an individual truck’s trip-chaining information from multiday GPS data. Individual trucks’ anchor points were identified with the spatial clustering algorithm for density-based spatial clustering of applications with noise. The anchor points were linked to construct individual trucks’ trip chains with 3-day GPS data, which showed that 51% of the trucks in the data set had at least one trip chain. A partitioning around medoids nonhierarchical clustering algorithm was applied to group trucks with similar trip-chaining characteristics. Four clusters were generated and validated by visual inspection when the trip-chaining statistics were distinct from each other. This study sheds light on modeling freight-chaining behav...

...read moreread less

22 citations

Journal Article•DOI•

A Two-Stage Algorithm for Origin-Destination Matrices Estimation Considering Dynamic Dispersion Parameter for Route Choice

[...]

Yong Wang¹, Yong Wang², Xiaolei Ma³, Yong Liu¹, Ke Gong¹, Kristian Henrickson², Maozeng Xu¹, Yinhai Wang² - Show less +4 more•Institutions (3)

Chongqing Jiaotong University¹, University of Washington², Beihang University³

13 Jan 2016-PLOS ONE

TL;DR: An innovative computational approach to accurately estimate OD matrices using link-level traffic flow data is proposed, and useful insight for optimal parameter selection in modeling travelers’ route choice behavior is provided.

...read moreread less

Abstract: This paper proposes a two-stage algorithm to simultaneously estimate origin-destination (OD) matrix, link choice proportion, and dispersion parameter using partial traffic counts in a congested network. A non-linear optimization model is developed which incorporates a dynamic dispersion parameter, followed by a two-stage algorithm in which Generalized Least Squares (GLS) estimation and a Stochastic User Equilibrium (SUE) assignment model are iteratively applied until the convergence is reached. To evaluate the performance of the algorithm, the proposed approach is implemented in a hypothetical network using input data with high error, and tested under a range of variation coefficients. The root mean squared error (RMSE) of the estimated OD demand and link flows are used to evaluate the model estimation results. The results indicate that the estimated dispersion parameter theta is insensitive to the choice of variation coefficients. The proposed approach is shown to outperform two established OD estimation methods and produce parameter estimates that are close to the ground truth. In addition, the proposed approach is applied to an empirical network in Seattle, WA to validate the robustness and practicality of this methodology. In summary, this study proposes and evaluates an innovative computational approach to accurately estimate OD matrices using link-level traffic flow data, and provides useful insight for optimal parameter selection in modeling travelers' route choice behavior.

...read moreread less

18 citations