Abstract
Urban bus transit agencies need reliable, network-wide delay predictions to provide accurate arrival information to passengers and support real-time operational control. Accurate predictions help passengers plan their trips, reduce waiting time, and allow operations staff to adjust headways, dispatch additional vehicles, and manage disruptions. Although real-time feeds such as GTFS-RT are now widely available, most existing delay prediction systems handle only a few routes, rely on hand-crafted features, and offer little guidance on designing a scalable, reusable architecture. We present a city-scale prediction pipeline that combines multi-resolution feature engineering, dimensionality reduction, and deep learning. The framework systematically generates spatiotemporal features by exploring aggregation combinations over spatial regions (using hexagonal hierarchical indexing), routes, segments, and temporal patterns, then compresses them using Adaptive PCA while preserving 95 % of the variance. To avoid the “giant cluster” problem that occurs when dense urban areas fall into a single spatial region, we introduce a hybrid clustering method that combines geographic and network topology information to yield balanced route clusters and enable efficient distributed training. We compare five model architectures on six months of bus operations from the Société de transport de Montréal (STM) network in Montréal. A global LSTM with cluster-aware features achieves the best trade-off between accuracy and efficiency (R2=0.7121 at the elementary level), outperforming XGBoost by 9.3 %, xLSTM by 5.3 %, and Autoformer by 43 % in terms of R2, while achieving comparable accuracy to PatchTST (R2=0.7043) with 77× fewer parameters. LSTM's compact architecture (31,000 parameters) effectively captures short-term temporal dependencies in the compressed feature space, making it more suitable than transformer models, which are overparameterized for this task. We also report multi-level evaluation at the elementary segment, segment, and trip level using walk-forward validation and latency analysis, showing that the proposed pipeline is suitable for real-time, city-scale deployment and can be reused for other networks with limited adaptation.
| Original language | English |
|---|---|
| Article number | 103811 |
| Journal | Journal of Systems Architecture |
| Volume | 176 |
| DOIs | |
| Publication status | Published - Jul 2026 |
!!!Keywords
- Feature engineering
- GTFS-realtime
- H3 geospatial indexing
- Intelligent transportation systems
- Scalability
- Transit delay prediction
- Urban mobility
Fingerprint
Dive into the research topics of 'Scalable transit delay prediction at city scale: A systematic approach with multi-resolution feature engineering and deep learning'. These topics are generated from the title and abstract of the publication. Together, they form a unique fingerprint.Cite this
- APA
- Author
- BIBTEX
- Harvard
- Standard
- RIS
- Vancouver