Abstract
Computational predictors of immunogenic peptides, or epitopes, are traditionally built based on data from a broad range of pathogens without consideration for taxonomic information. While this approach may be reasonable if one aims to develop one-size-fits-all models, it may be counterproductive if the proteins for which the model is expected to generalize are known to come from a specific subset of phylogenetically related pathogens. There is mounting evidence that, for these cases, taxon-specific models can outperform generalist ones, even when trained with substantially smaller amounts of data. In this comment, we provide some perspective on the current state of taxon-specific modelling for the prediction of linear B-cell epitopes, and the challenges faced when building and deploying these predictors.
Original language | English |
---|---|
Article number | bbae092 |
Journal | Briefings in Bioinformatics |
Volume | 25 |
Issue number | 2 |
DOIs | |
Publication status | Published - 1 Mar 2024 |
Bibliographical note
Publisher Copyright:© 2024 The Author(s). Published by Oxford University Press.
Keywords
- data mining
- epitope prediction
- machine learning
- phylogeny-aware modelling