scispace - formally typeset
Proceedings ArticleDOI

Effects of Clustering Feature Vectors on Bus Travel Time Prediction: A Case Study

TLDR
In this article, the authors analyzed the use of different feature vectors for clustering and the effect on travel time predictions and showed that the prediction accuracy is highest when only travel times are used as a clustering feature vector.
Abstract
Improving the accuracy of travel time predictions depends on providing the correct inputs as well as the prediction algorithm used. Clustering algorithms can be used to identify the patterns in the data, which can improve the inputs to the prediction algorithm. The feature vectors used for clustering greatly affect the clusters formed and, ultimately, the prediction performance. Clustering being an unsupervised learning technique, the accuracy or correctness of the cluster formed can not be evaluated directly. A possible solution for this would be to link the problem with prediction accuracy and choose the feature vector combination with maximum prediction accuracy. The present study analyses the use of different feature vectors for clustering and the effect on travel time predictions. Here, three cases, namely, travel time alone, travel time along with features such as time of the day, section index, and day of the week as numerical features and as a mix of categorical and numerical feature vectors, are studied. The effects of using each of these cases as clustering feature vectors on travel time predictions are evaluated. It is observed that the prediction accuracy is the highest when only travel times are used as a clustering feature vector. The study demonstrates the importance of choosing the correct feature vectors for clustering and its effect on a final application, namely, travel time prediction.

read more

References
More filters
Journal ArticleDOI

The Cosine-Haversine Formula

Proceedings ArticleDOI

VAT: a tool for visual assessment of (cluster) tendency

TL;DR: A method is given for visually assessing the cluster tendency of a set of Objects O when they are represented either as object vectors or by numerical pairwise dissimilarity values.
Journal ArticleDOI

Day-to-Day Travel-Time Trends and Travel-Time Prediction from Loop-Detector Data:

TL;DR: An approach is presented for estimating future travel times on a freeway using flow and occupancy data from single-loop detectors and historical travel-time information and linear regression, with the stepwise-variable-selection method and more advanced tree-based methods.
Book

Digital Design and Computer Architecture

TL;DR: Digital Design and Computer Architecture as discussed by the authors provides practical examples of how to interface with peripherals using RS232, SPI, motor control, interrupts, wireless, and analog-to-digital conversion.
Related Papers (5)