Xinyue Zhang, Xinyi Sang, Beibei Liu, Quanyu Wang,. Methods and Applications: Developing Machine Learning Prediction Model for Daily Influenza Reported Cases Using Multichannel Surveillance Data - A City, Hubei Province, China, 2023~2025. China CDC Weekly, 2025, 7(44): 1396-1402
Introduction
Public health surveillance is crucial for decision-making. Given the significant threat of influenza to public health, developing predictive models using multichannel surveillance systems is imperative.
Methods
Data were collected from multichannel surveillance systems, including hospitals, search engines, and climatological and air pollutant surveillance systems, in a southern Chinese city from January 2023 to January 2025. Spearman’s correlation analysis assessed the relationships between variables and reported influenza cases. Several machine learning models were used to predict trends in reported cases.
Results
Correlation analysis showed that all four surveillance systems were related to influenza, with 27 variables correlated with daily reported cases. The Long Short-Term Memory model, established based on variables with the highest lagged correlations (5-day to 7-day lag) through combined surveillance systems, outperformed other models for 5-day forecasts (R2=0.92; mean absolute error=156.92; mean absolute percentage error=79.95%; root Mean Squared Error=292.33).
Conclusions
Data from various surveillance systems effectively track influenza epidemics. The model shows potential for infectious disease surveillance and epidemic preparedness.
Public health surveillance is crucial for decision-making. Given the significant threat of influenza to public health, developing predictive models using multichannel surveillance systems is imperative.
Methods
Data were collected from multichannel surveillance systems, including hospitals, search engines, and climatological and air pollutant surveillance systems, in a southern Chinese city from January 2023 to January 2025. Spearman’s correlation analysis assessed the relationships between variables and reported influenza cases. Several machine learning models were used to predict trends in reported cases.
Results
Correlation analysis showed that all four surveillance systems were related to influenza, with 27 variables correlated with daily reported cases. The Long Short-Term Memory model, established based on variables with the highest lagged correlations (5-day to 7-day lag) through combined surveillance systems, outperformed other models for 5-day forecasts (R2=0.92; mean absolute error=156.92; mean absolute percentage error=79.95%; root Mean Squared Error=292.33).
Conclusions
Data from various surveillance systems effectively track influenza epidemics. The model shows potential for infectious disease surveillance and epidemic preparedness.
See Also:
Latest articles in those days:
- Analysis of the gene sequences of two cases of human infection with avian influenza H9N2 in Guizhou province in 2024 1 hours ago
- Influenza hijacks myeloid cells to inflict type-I interferon-fueled damage in the heart 2 hours ago
- MHC class II functions as a host-specific entry receptor for representative human and swine H3N2 influenza A viruses 2 hours ago
- Longitudinal Surveillance of Influenza A Virus Exposure in Wild Boars (Sus scrofa) in Spain (2015-2023): Serologic and Virologic Evidence of Subtype Infections and H5N1 Spillover Risk 2 hours ago
- [preprint]Emergence and antigenic characterisation of influenza A(H3N2) viruses with hemagglutinin substitutions N158K and K189R during the 2024/25 influenza season 21 hours ago
[Go Top] [Close Window]


