Borkenhagen LK, Runstadler JA. Examining the Influenza A Virus Sialic Acid Binding Preference Predictions of a Sequence-Based Convolutional Neural Network. Influenza Other Respir Viruses. 2024 Dec;18(12):e7
Background: Though receptor binding specificity is well established as a contributor to host tropism and spillover potential of influenza A viruses, determining receptor binding preference of a specific virus still requires expensive and time-consuming laboratory analyses. In this study, we pilot a machine learning approach for prediction of binding preference.
Methods: We trained a convolutional neural network to predict the α2,6-linked sialic acid preference of influenza A viruses given the hemagglutinin amino acid sequence. The model was evaluated with an independent test dataset to assess the standard performance metrics, the impact of missing data in the test sequences, and the prediction performance on novel subtypes. Further, features found to be important to the generation of predictions were tested via targeted mutagenesis of H9 and H16 proteins expressed on pseudoviruses.
Results: The final model developed in this study produced predictions on a test dataset correctly 94% of the time and an area under the receiver operating characteristic curve of 0.93. The model tolerated about 10% missing test data without compromising accurate prediction performance. Predictions on novel subtypes revealed that the model can extrapolate feature relationships between subtypes when generating binding predictions. Finally, evaluation of the features important for model predictions helped identify positions that alter the sialic acid conformation preference of hemagglutinin proteins in practice.
Conclusions: Ultimately, our results provide support to this in silico approach to hemagglutinin receptor binding preference prediction. This work emphasizes the need for ongoing research efforts to produce tools that may aid future pandemic risk assessment.
Methods: We trained a convolutional neural network to predict the α2,6-linked sialic acid preference of influenza A viruses given the hemagglutinin amino acid sequence. The model was evaluated with an independent test dataset to assess the standard performance metrics, the impact of missing data in the test sequences, and the prediction performance on novel subtypes. Further, features found to be important to the generation of predictions were tested via targeted mutagenesis of H9 and H16 proteins expressed on pseudoviruses.
Results: The final model developed in this study produced predictions on a test dataset correctly 94% of the time and an area under the receiver operating characteristic curve of 0.93. The model tolerated about 10% missing test data without compromising accurate prediction performance. Predictions on novel subtypes revealed that the model can extrapolate feature relationships between subtypes when generating binding predictions. Finally, evaluation of the features important for model predictions helped identify positions that alter the sialic acid conformation preference of hemagglutinin proteins in practice.
Conclusions: Ultimately, our results provide support to this in silico approach to hemagglutinin receptor binding preference prediction. This work emphasizes the need for ongoing research efforts to produce tools that may aid future pandemic risk assessment.
See Also:
Latest articles in those days:
- Intranasal influenza virus-vectored vaccine offers protection against clade 2.3.4.4b H5N1 infection in small animal models 5 hours ago
- Mapping of stakeholders in avian influenza surveillance in Canada 17 hours ago
- [preprint]Population Immunity to Hemagglutinin Head, Stalk and Neuraminidase of Highly Pathogenic Avian Influenza 2.3.4.4b A(H5N1) viruses in the United States and the Impact of Seasonal Influenza on 1 days ago
- Airborne Influenza Virus Surveillance Platform Using Paper-Based Immunosensors and a Growth-Based Virus Aerosol Concentrator 1 days ago
- [preprint]A Human H5N1 Influenza Virus Expressing Bioluminescence for Evaluating Viral Infection and Identifying Therapeutic Interventions 2 days ago
[Go Top] [Close Window]