Wang, Y., Zhai, S., Wu, C. et al. Dynamic ensemble deep learning with multi-source data for robust influenza forecasting in Yangzhou. BMC Public Health (2025)
Background: Traditional influenza surveillance often suffers from reporting delays, hindering timely public health response. This study aimed to mitigate this limitation by developing an accurate deep learning framework for influenza prediction.
Methods: We constructed a 13-year (652-week) multi-source dataset for Yangzhou City, China, integrating influenza-like illness (ILI) surveillance data with Baidu search indices and meteorological variables. Six state-of-the-art deep learning models-GRU, Transformer, LSTM, TFT (Temporal Fusion Transformer), TCN (Temporal Convolutional Network), and N-BEATS-were systematically compared under 1-, 5-, and 9-week sliding time windows. Based on their robust performance across windows, GRU, TCN, and Transformer were selected as core base learners. Using these models, we designed several ensemble strategies, culminating in a dynamic weighted ensemble with seasonal residual adjustment (DWE + SRA).
Results: Multi-source data integration improved predictive accuracy relative to using surveillance data alone. Forecasting performance varied with the sliding time window: under a 1-week window, all models showed comparable accuracy, with GRU achieving the lowest RMSE and highest R2 and Transformer the lowest MAE; at 5 weeks, TCN achieved the best overall performance, while Transformer, and GRU also maintained relatively good accuracy; at 9 weeks, GRU, TCN, and Transformer remained competitive, whereas TFT and N-BEATS degraded. Overall, GRU, TCN, and Transformer exhibited the most robust performance across window lengths. The proposed DWE + SRA strategy, built on these three base learners, further enhanced forecasting accuracy and stability, reducing test-set RMSE and MAE by approximately 28% and 17%, respectively, compared with the best single model (GRU), and closely tracking observed ILI dynamics during both peak and off-peak periods.
Conclusion: This study presents a multi-source deep learning framework that effectively integrates heterogeneous data to compensate for surveillance delays. Key contributions include: (i) a systematic sliding-window comparison that reveals the temporal strengths of different architectures; and (ii) a DWE + SRA ensemble strategy that dynamically adjusts model weights and corrects seasonal biases to substantially improve prediction stability. This work provides a scalable, data-driven paradigm for localized influenza forecasting and early warning.
Methods: We constructed a 13-year (652-week) multi-source dataset for Yangzhou City, China, integrating influenza-like illness (ILI) surveillance data with Baidu search indices and meteorological variables. Six state-of-the-art deep learning models-GRU, Transformer, LSTM, TFT (Temporal Fusion Transformer), TCN (Temporal Convolutional Network), and N-BEATS-were systematically compared under 1-, 5-, and 9-week sliding time windows. Based on their robust performance across windows, GRU, TCN, and Transformer were selected as core base learners. Using these models, we designed several ensemble strategies, culminating in a dynamic weighted ensemble with seasonal residual adjustment (DWE + SRA).
Results: Multi-source data integration improved predictive accuracy relative to using surveillance data alone. Forecasting performance varied with the sliding time window: under a 1-week window, all models showed comparable accuracy, with GRU achieving the lowest RMSE and highest R2 and Transformer the lowest MAE; at 5 weeks, TCN achieved the best overall performance, while Transformer, and GRU also maintained relatively good accuracy; at 9 weeks, GRU, TCN, and Transformer remained competitive, whereas TFT and N-BEATS degraded. Overall, GRU, TCN, and Transformer exhibited the most robust performance across window lengths. The proposed DWE + SRA strategy, built on these three base learners, further enhanced forecasting accuracy and stability, reducing test-set RMSE and MAE by approximately 28% and 17%, respectively, compared with the best single model (GRU), and closely tracking observed ILI dynamics during both peak and off-peak periods.
Conclusion: This study presents a multi-source deep learning framework that effectively integrates heterogeneous data to compensate for surveillance delays. Key contributions include: (i) a systematic sliding-window comparison that reveals the temporal strengths of different architectures; and (ii) a DWE + SRA ensemble strategy that dynamically adjusts model weights and corrects seasonal biases to substantially improve prediction stability. This work provides a scalable, data-driven paradigm for localized influenza forecasting and early warning.
See Also:
Latest articles in those days:
- [preprint]Emergence and antigenic characterisation of influenza A(H3N2) viruses with hemagglutinin substitutions N158K and K189R during the 2024/25 influenza season 16 hours ago
- Epitope specificity shapes the CD4+ T cell response to influenza virus infection in mice 16 hours ago
- Vaccination against H5 HP avian influenza virus leads to persistent immune response in wild king penguins 1 days ago
- Molecular Epidemiology and Genetic Diversity of Influenza B Viruses Based on Whole-Genome Analysis in Japan and Myanmar, 2016-2020 1 days ago
- Assessing HPAI-H5 transmission risk across wild bird migratory flyways in the United States 1 days ago
[Go Top] [Close Window]


