Benchmarking Travel Time and Demand Prediction Methods Using Large-scale Metro Smart Card Data

Iyad Zimmo; Daniel Hörcher; Ramandeep Singh; Daniel J. Graham

doi:10.3311/PPtr.22252

Authors

Iyad Zimmo

Affiliation

Transport Strategy Centre, Department of Civil and Environmental Engineering, Faculty of Engineering, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
Daniel Hörcher

Affiliation

Transport Strategy Centre, Department of Civil and Environmental Engineering, Faculty of Engineering, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
Ramandeep Singh

Affiliation

Transport Strategy Centre, Department of Civil and Environmental Engineering, Faculty of Engineering, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
Daniel J. Graham

Affiliation

Transport Strategy Centre, Department of Civil and Environmental Engineering, Faculty of Engineering, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom

Abstract

Urban mass transit systems generate large volumes of data via automated systems established for ticketing, signalling, and other operational processes. This study is motivated by the observation that despite the availability of sophisticated quantitative methods, most public transport operators are constrained in exploiting the information their datasets contain. This paper intends to address this gap in the context of real-time demand and travel time prediction with smart card data. We comparatively benchmark the predictive performance of four quantitative prediction methods: multivariate linear regression (MVLR) and semiparametric regression (SPR) widely used in the econometric literature, and random forest regression (RFR) and support vector machine regression (SVMR) from machine learning. We find that the SVMR and RFR methods are the most accurate in travel flow and travel time prediction, respectively. However, we also find that the SPR technique offers lower computation time at the expense of minor inefficiency in predictive power in comparison with the two machine learning methods.

Keywords:

machine learning, urban public transport, demand prediction, smart card data

Citation data from Crossref and Scopus

Benchmarking Travel Time and Demand Prediction Methods Using Large-scale Metro Smart Card Data

Authors

Abstract

Keywords:

Citation data from Crossref and Scopus

Published Online

How to Cite

Issue

Section

Make a Submission