An effective spatial-temporal attention based neural network for traffic flow prediction

doi:10.1016/J.TRC.2019.09.008

Journal ArticleDOI

An effective spatial-temporal attention based neural network for traffic flow prediction

Loan N.N. Do, +4 more

- 01 Nov 2019 -

Transportation Research Part C-emerging ...

- Vol. 108, pp 12-28

TLDR

A deep learning based traffic flow predictor with spatial and temporal attentions (STANN) is proposed, which is demonstrated to have potential for improving the understanding of spatial-temporal correlations in a traffic network.

Abstract:

Due to its importance in Intelligent Transport Systems (ITS), traffic flow prediction has been the focus of many studies in the last few decades. Existing traffic flow prediction models mainly extract static spatial-temporal correlations, although these correlations are known to be dynamic in traffic networks. Attention-based models have emerged in recent years, mostly in the field of natural language processing, and have resulted in major progresses in terms of both accuracy and interpretability. This inspires us to introduce the application of attentions for traffic flow prediction. In this study, a deep learning based traffic flow predictor with spatial and temporal attentions (STANN) is proposed. The spatial and temporal attentions are used to exploit the spatial dependencies between road segments and temporal dependencies between time steps respectively. Experiment results with a real-world traffic dataset demonstrate the superior performance of the proposed model. The results also show that the utilization of multiple data resolutions could help improve prediction accuracy. Furthermore, the proposed model is demonstrated to have potential for improving the understanding of spatial-temporal correlations in a traffic network.

Citations

PDF

Open Access

More filters

Repository

Forecasting: theory and practice

Fotios Petropoulos, +84 more

- 04 Dec 2020 -

arXiv: Applications

TL;DR: A non-systematic review of the theory and the practice of forecasting, offering a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organise, and evaluate forecasts.

...read moreread less

Journal ArticleDOI

A Survey on Modern Deep Neural Network for Traffic Prediction: Trends, Methods and Challenges

David Alexander Tedjopurnomo, +4 more

- 09 Jun 2020 -

IEEE Transactions on Knowledge and Data ...

TL;DR: A detailed explanation of popular deep neural network architectures commonly used in the traffic flow prediction literatures, categorize and describe the literatures themselves, and present an overview of the commonalities and differences among different works are presented.

...read moreread less

Journal ArticleDOI

Forecasting: theory and practice

None Sebastian

- 01 Jul 2022 -

International Journal of Forecasting

TL;DR: In this paper , the authors provide an overview of a wide range of theoretical, state-of-the-art models, methods, principles, and approaches to prepare, produce, organize, and evaluate forecasts.

...read moreread less

Journal ArticleDOI

Machine Learning-based traffic prediction models for Intelligent Transportation Systems

Azzedine Boukerche, +1 more

- 09 Nov 2020 -

Computer Networks

TL;DR: A clear and thorough review of different ML models is built up, and the advantages and disadvantages of these ML models are analyzed, based on the ML theory they use, to have a macro overview of what types of ML methods are good at what type of prediction tasks according to their unique model features.

...read moreread less

Journal ArticleDOI

Spatial-Temporal Graph Attention Networks: A Deep Learning Approach for Traffic Forecasting

Chenhan Zhang, +2 more

- 18 Nov 2019 -

IEEE Access

TL;DR: A novel deep learning framework, Spatial-Temporal Graph Attention Networks (ST-GAT), a graph attention mechanism is adopted to extract the spatial dependencies among road segments and a LSTM network is introduced to extract temporal domain features.

...read moreread less

Collapse

References

PDF

Open Access

More filters

Proceedings Article

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

TL;DR: This work introduces Adam, an algorithm for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimates of lower-order moments, and provides a regret bound on the convergence rate that is comparable to the best known results under the online convex optimization framework.

...read moreread less

Proceedings Article

ImageNet Classification with Deep Convolutional Neural Networks

Alex Krizhevsky, +2 more

TL;DR: The state-of-the-art performance of CNNs was achieved by Deep Convolutional Neural Networks (DCNNs) as discussed by the authors, which consists of five convolutional layers, some of which are followed by max-pooling layers, and three fully-connected layers with a final 1000-way softmax.

...read moreread less

Posted Content

Adam: A Method for Stochastic Optimization

Diederik P. Kingma, +1 more

- 22 Dec 2014 -

arXiv: Learning

TL;DR: In this article, the adaptive estimates of lower-order moments are used for first-order gradient-based optimization of stochastic objective functions, based on adaptive estimate of lowerorder moments.

...read moreread less

Proceedings ArticleDOI

Learning Phrase Representations using RNN Encoder--Decoder for Statistical Machine Translation

Kyunghyun Cho, +8 more

TL;DR: In this paper, the encoder and decoder of the RNN Encoder-Decoder model are jointly trained to maximize the conditional probability of a target sequence given a source sequence.

...read moreread less

Posted Content

Sequence to Sequence Learning with Neural Networks

Ilya Sutskever, +2 more

- 10 Sep 2014 -

arXiv: Computation and Language

TL;DR: This paper presents a general end-to-end approach to sequence learning that makes minimal assumptions on the sequence structure, and finds that reversing the order of the words in all source sentences improved the LSTM's performance markedly, because doing so introduced many short term dependencies between the source and the target sentence which made the optimization problem easier.

...read moreread less