TY - JOUR
T1 - Organism-Specific Training Improves Performance of Linear B-Cell Epitope Prediction
AU - Ashford, Jodie
AU - Reis-Cunha, João
AU - Lobo, Igor
AU - Pereira Lobo, Francisco
AU - Campelo, Felipe
PY - 2021/12/1
Y1 - 2021/12/1
N2 - Motivation: In silico identification of linear B-cell epitopes represents an important step in the development of diagnostic tests and vaccine candidates, by providing potential high-probability targets for experimental investigation. Current predictive tools were developed under a generalist approach, training models with heterogeneous datasets to develop predictors that can be deployed for a wide variety of pathogens. However, continuous advances in processing power and the increasing amount of epitope data for a broad range of pathogens indicate that training organism or taxonspecific models may become a feasible alternative, with unexplored potential gains in predictive performance. Results: This article shows how organism-specific training of epitope prediction models can yield substantial performance gains across several quality metrics when compared to models trained with heterogeneous and hybrid data, and with a variety of widely used predictors from the literature. These results suggest a promising alternative for the development of custom-tailored predictive models with high predictive power, which can be easily implemented and deployed for the investigation of specific pathogens.
AB - Motivation: In silico identification of linear B-cell epitopes represents an important step in the development of diagnostic tests and vaccine candidates, by providing potential high-probability targets for experimental investigation. Current predictive tools were developed under a generalist approach, training models with heterogeneous datasets to develop predictors that can be deployed for a wide variety of pathogens. However, continuous advances in processing power and the increasing amount of epitope data for a broad range of pathogens indicate that training organism or taxonspecific models may become a feasible alternative, with unexplored potential gains in predictive performance. Results: This article shows how organism-specific training of epitope prediction models can yield substantial performance gains across several quality metrics when compared to models trained with heterogeneous and hybrid data, and with a variety of widely used predictors from the literature. These results suggest a promising alternative for the development of custom-tailored predictive models with high predictive power, which can be easily implemented and deployed for the investigation of specific pathogens.
UR - https://research.aston.ac.uk/en/publications/07b81b1b-4bde-4550-851f-2a88b271274c
U2 - 10.1093/bioinformatics/btab536
DO - 10.1093/bioinformatics/btab536
M3 - Article (Academic Journal)
C2 - 34289025
SN - 1367-4811
VL - 37
SP - 4826–4834,
JO - Bioinformatics
JF - Bioinformatics
IS - 24
ER -