Increasing availability of large-scale datasets for automatic vehicle location (AVL) in public transportation (PT) encouraged researchers to investigate data-driven punctuality prediction models (PPMs). PPMs promise to accelerate the mobility transition through more accurate prediction delays, increased customer service levels, and more efficient and forward-looking planning by mobility providers. While several PPMs show promising results for buses and long-distance trains, a comprehensive study on external factors' effect on tram services is missing. Therefore, we implement four machine learning (ML) models to predict departure delays and elaborate on the performance increase by adding real-world weather and holiday data for three consecutive years. For our best model (XGBoost) the average MAE performance increased by 17.33 % compared to the average model performance when only trained on AVL data enriched by timetable characteristics. The results provide strong evidence that adding information-bearing features improves the forecast quality of PPMs.

Paper Number



Track 20: Prescriptive analytics, mobility & logistics