scispace - formally typeset
Journal ArticleDOI

An effective spatial-temporal attention based neural network for traffic flow prediction

TLDR
A deep learning based traffic flow predictor with spatial and temporal attentions (STANN) is proposed, which is demonstrated to have potential for improving the understanding of spatial-temporal correlations in a traffic network.
Abstract
Due to its importance in Intelligent Transport Systems (ITS), traffic flow prediction has been the focus of many studies in the last few decades. Existing traffic flow prediction models mainly extract static spatial-temporal correlations, although these correlations are known to be dynamic in traffic networks. Attention-based models have emerged in recent years, mostly in the field of natural language processing, and have resulted in major progresses in terms of both accuracy and interpretability. This inspires us to introduce the application of attentions for traffic flow prediction. In this study, a deep learning based traffic flow predictor with spatial and temporal attentions (STANN) is proposed. The spatial and temporal attentions are used to exploit the spatial dependencies between road segments and temporal dependencies between time steps respectively. Experiment results with a real-world traffic dataset demonstrate the superior performance of the proposed model. The results also show that the utilization of multiple data resolutions could help improve prediction accuracy. Furthermore, the proposed model is demonstrated to have potential for improving the understanding of spatial-temporal correlations in a traffic network.

read more

Citations
More filters
Repository

Forecasting: theory and practice

Fotios Petropoulos, +84 more
- 04 Dec 2020 - 
TL;DR: A non-systematic review of the theory and the practice of forecasting, offering a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organise, and evaluate forecasts.
Journal ArticleDOI

A Survey on Modern Deep Neural Network for Traffic Prediction: Trends, Methods and Challenges

TL;DR: A detailed explanation of popular deep neural network architectures commonly used in the traffic flow prediction literatures, categorize and describe the literatures themselves, and present an overview of the commonalities and differences among different works are presented.
Journal ArticleDOI

Forecasting: theory and practice

TL;DR: In this paper , the authors provide an overview of a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organize, and evaluate forecasts.
Journal ArticleDOI

Machine Learning-based traffic prediction models for Intelligent Transportation Systems

TL;DR: A clear and thorough review of different ML models is built up, and the advantages and disadvantages of these ML models are analyzed, based on the ML theory they use, to have a macro overview of what types of ML methods are good at what type of prediction tasks according to their unique model features.
Journal ArticleDOI

Spatial-Temporal Graph Attention Networks: A Deep Learning Approach for Traffic Forecasting

TL;DR: A novel deep learning framework, Spatial-Temporal Graph Attention Networks (ST-GAT), a graph attention mechanism is adopted to extract the spatial dependencies among road segments and a LSTM network is introduced to extract temporal domain features.
References
More filters
Proceedings Article

Adam: A Method for Stochastic Optimization

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.
Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.
Posted Content

Adam: A Method for Stochastic Optimization

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.
Proceedings ArticleDOI

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.
Posted Content

Sequence to Sequence Learning with Neural Networks

TL;DR: This paper presents a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure, and finds that reversing the order of the words in all source sentences improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.
Related Papers (5)