Rongye Ye, Lun Li, Shuhui Song. [preprint]Influ-BERT: An Interpretable Model for Enhancing Low-Frequency Influenza A virus Subtype Recognition. https://doi.org/10.1101/2025.07.31.667841
Influenza A Virus (IAV) poses a continuous threat to global public health due to its wide host adaptability, high-frequency antigenic variation, and potential for cross-species transmission. Accurate recognition of IAV subtypes is cru-cial for the early pandemic warning. Here, we propose Influ-BERT, a domain-adaptive pretraining model based on the transformer architecture. Optimized from DNABERT-2, Influ-BERT constructed a dedicated corpus of approximately 900,000 influenza genome sequences, developed a custom Byte Pair Encoding (BPE) tokenizer, and employ a two-stage training strategy involving domain-adaptive pretraining followed by task-specific fine-tuning. This approach significantly enhanced recognition performance for low-frequency subtypes. Experimental results demonstrate that Influ-BERT outper-forms traditional machine learning methods and general genomic language models (DNABERT-2, MegaDNA) in subtype recognition, achieving a substantial improvement in F1-score, particularly for subtypes H5N8, H5N1, H7N9, H9N2. Furthermore, sliding window perturbation analysis revealed the model´s specific focus on key regions of the IAV genome, providing interpretable evidence supporting the observed performance gains.
See Also:
Latest articles in those days:
- Engineered Bacillus subtilis to deliver dsRNA via extracellular vesicles against the H9N2 avian influenza virus 48 minute(s) ago
- [preprint]Spatiotemporal dynamics and ecological risk factors of highly pathogenic avian influenza A(H5N1) in Canadian wildlife: A One Health surveillance analysis 49 minute(s) ago
- Epidemiological and Virological Characteristics of H9N2 Avian Influenza Virus in Jiangsu Province, China, 2024 12 hours ago
- Innate Pathway Selection Modulates Antibody and T-Cell Responses to Mosaic Influenza Nucleoprotein in Cattle 1 days ago
- Game Over for the Baseline: Influenza Hospitalization Patterns Before, During, and After the COVID-19 Pandemic (FluSurv-NET, 2009–2025) 1 days ago
[Go Top] [Close Window]


