scispace - formally typeset
Search or ask a question
Journal ArticleDOI

Classifying travelers' driving style using basic safety messages generated by connected vehicles: application of unsupervised machine learning

01 Jan 2021-Transportation Research Part C-emerging Technologies (Elsevier Publishing)-Vol. 122, pp 102917
TL;DR: A framework that harnesses Basic Safety Messages generated by connected vehicles to quantify instantaneous driving behavior and classify driving styles in different spatial contexts using unsupervised machine learning methods is developed.
Abstract: Driving style can substantially impact mobility, safety, energy consumption, and vehicle emissions. While a range of methods has been used in the past for driving style classification, the emergence of connected vehicles equipped with communication devices provides a new opportunity to classify driving style using high-resolution (10 Hz) microscopic real-world data. In this study, location-based big data and machine learning are used to classify driving styles ranging from aggressive to calm. This classification can be used to customize driver assistance systems, assess mobility, crash risk, fuel consumption, and emissions. This study’s main objective is to develop a framework that harnesses Basic Safety Messages (BSMs) generated by connected vehicles to quantify instantaneous driving behavior and classify driving styles in different spatial contexts using unsupervised machine learning methods. To this end, a subset of the Safety Pilot Model Deployment (SPMD) with more than 27 million BSM observations generated by more than 1300 individuals making trips on diverse roadways and through several neighborhoods in Ann Arbor, Michigan, were processed and analyzed. To quantify driving style, the concept of temporal driving volatility, as a surrogate safety measure of unsafe driving behavior, was utilized and applied to vehicle kinematics, i.e., observed speeds and longitudinal/lateral accelerations. Specifically, six volatility measures are extracted and used for classifying drivers. K-means and K-medoids methods are applied for grouping drivers in aggressive, normal, and calm clusters. Clustering results indicate that not only does driving style vary among drivers, but the thresholds for aggressive and calm driving vary across different roadway types due to variations in environment and road conditions. The proportion of aggressive driving styles was also higher on commercial streets than on highways and residential streets. Notably, we propose a Driving Score to measure driving performance consistently across drivers.
Citations
More filters
Journal ArticleDOI
TL;DR: In this article, the authors investigated the effects of AVs on the behavior of a following human-driver in mixed traffic streams and found that a driver that follows an AV exhibits lower driving volatility in terms of speed and acceleration, which represents more stable traffic flow behavior and lower crash risk.

35 citations

Journal ArticleDOI
TL;DR: In this article, the authors evaluated the impact of various AV Market Penetration Rates (MPR) on the safety and operation of urban arterials in proximity of a driveway under different traffic levels of service (LOS).

31 citations

Journal ArticleDOI
TL;DR: Real-time risk assessment studies have investigated a limited length of corridors, however, the necessity of assessing the safety performance of Connected Vehicles (CVs) requires looking into an en...
Abstract: Real-time risk assessment studies have investigated a limited length of corridors. However, the necessity of assessing the safety performance of Connected Vehicles (CVs) requires looking into an en...

28 citations

Journal ArticleDOI
TL;DR: In this article, a 1D-Convolutional neural network (1D-CNN), LSTM and 1DCNN-LSTM were used to predict the occurrence of safety critical events and generate appropriate feedback to drivers and surrounding vehicles.

26 citations

Journal ArticleDOI
TL;DR: In this article, the authors applied a new methodology to capture variation in crashes in both space and time by using Geographically and Temporally Weighted Regression (GTWR) models for the localization of SPFs.

18 citations

References
More filters
Journal ArticleDOI
01 Jun 2010
TL;DR: A brief overview of clustering is provided, well known clustering methods are summarized, the major challenges and key issues in designing clustering algorithms are discussed, and some of the emerging and useful research directions are pointed out.
Abstract: Organizing data into sensible groupings is one of the most fundamental modes of understanding and learning. As an example, a common scheme of scientific classification puts organisms into a system of ranked taxa: domain, kingdom, phylum, class, etc. Cluster analysis is the formal study of methods and algorithms for grouping, or clustering, objects according to measured or perceived intrinsic characteristics or similarity. Cluster analysis does not use category labels that tag objects with prior identifiers, i.e., class labels. The absence of category information distinguishes data clustering (unsupervised learning) from classification or discriminant analysis (supervised learning). The aim of clustering is to find structure in data and is therefore exploratory in nature. Clustering has a long and rich history in a variety of scientific fields. One of the most popular and simple clustering algorithms, K-means, was first published in 1955. In spite of the fact that K-means was proposed over 50 years ago and thousands of clustering algorithms have been published since then, K-means is still widely used. This speaks to the difficulty in designing a general purpose clustering algorithm and the ill-posed problem of clustering. We provide a brief overview of clustering, summarize well known clustering methods, discuss the major challenges and key issues in designing clustering algorithms, and point out some of the emerging and useful research directions, including semi-supervised clustering, ensemble clustering, simultaneous feature selection during data clustering, and large scale data clustering.

6,601 citations

Journal ArticleDOI
TL;DR: Experimental results show that the proposed algorithm takes a significantly reduced time in computation with comparable performance against the partitioning around medoids.
Abstract: This paper proposes a new algorithm for K-medoids clustering which runs like the K-means algorithm and tests several methods for selecting initial medoids. The proposed algorithm calculates the distance matrix once and uses it for finding new medoids at every iterative step. To evaluate the proposed algorithm, we use some real and artificial data sets and compare with the results of other algorithms in terms of the adjusted Rand index. Experimental results show that the proposed algorithm takes a significantly reduced time in computation with comparable performance against the partitioning around medoids.

1,629 citations

Journal ArticleDOI
TL;DR: Qualitative issues relevant to the study of differential crash involvement and the findings of research in this area are considered and the ways in which research in the area might usefully proceed are reviewed.
Abstract: This article considers methodological issues relevant to the study of differential crash involvement and reviews the findings of research in this area. Aspects of both driving skill and driving style appear to contribute to crash risk. Of the former, hazard-perception latency appears to play an important role, and this may be attributable to generalized abilities to identify visual targets in a complex background and to switch attention rapidly. Of the latter, faster driving speed and willingness to commit driving violations increase crash risk, and these factors may be explicable in terms of personality and antisocial motivation. The article concludes with an examination of the practical implications and of the ways in which research in this area might usefully proceed.

698 citations

Journal ArticleDOI
TL;DR: The behavioral validation of an advanced driving simulator for its use in evaluating speeding countermeasures was performed for mean speed, with participants generally drove faster in the instrumented car than the simulator, resulting in absolute validity not being established.

473 citations