Predictions on public transport ridership are beneficial as they allow for sufficient and cost-efficient deployment of vehicles. At an operational level, this relates to short-term predictions with lead times of less than an hour. Where conventional data sources on ridership, such as Automatic Fare Collection (AFC) data, may have longer lag times, in contrast, trip planner data is often available in (near) real-time. This paper analyzes how such data from a trip planner app can be utilized for short-term bus ridership predictions. This is combined with AFC data (in this case smart card data) to construct a ground-truth on actual ridership. The trip planner data is studied using correlation analysis to select informative variables, that are then used to develop 4 supervised machine learning models (linear, k-nearest neighbors, random forest, and gradient boosting decision tree). The best performing model relies on random forest regression and reduces the error by approximately half compared to a baseline model based on the weekly trend. We show that this model performance is maintained even for prediction lead times up to 30 minutes ahead, and for different periods of the day.
|Title of host publication||Proceedings of the 15th International Conference on Advanced Systems in Public Transport (CASPT2022)|
|Number of pages||24|
|Publication status||Published - 2022|
|Event||CASPT2022: 15th International Conference on Advanced Systems in Public Transport - Tel Aviv, Israel|
Duration: 6 Nov 2022 → 10 Nov 2022
|Conference||CASPT2022: 15th International Conference on Advanced Systems in Public Transport|
|Period||6/11/22 → 10/11/22|
Bibliographical noteGreen Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
- Public Transport
- Trip Planner
- Bus Ridership Prediction
- Machine Learning