TY - UNPB
T1 - Conventional and Bayesian workflows for clinical prediction modelling of severe Covid-19 outcomes based on clinical biomarker test results
T2 - LabMarCS: Laboratory Markers of COVID-19 Severity - Bristol Cohort
AU - Sullivan, B
AU - Barker, E
AU - Williams, P
AU - MacGregor, L
AU - Bhamber, R
AU - Thomas, M
AU - Gurney, S
AU - Hyams, C
AU - Whiteway, A
AU - Cooper, JA
AU - McWilliams, C
AU - Turner, K
AU - Dowsey, AW
AU - Albur, M
PY - 2023/4/26
Y1 - 2023/4/26
N2 - We describe several regression models to predict severe outcomes in COVID-19 and challenges present in complex observational medical data. We demonstrate best practices for data curation, cross-validated statistical modelling, and variable selection emphasizing recent Bayesian methods. The study follows a retrospective observational cohort design using multicentre records across National Health Service (NHS) trusts in southwest England, UK. Participants included hospitalised adult patients positive for SARS-CoV 2 during March to October 2020, totalling 843 patients (mean age 71, 45% female, 32% died or needed ICU stay),split into training (n=590) and validation groups (n=253). Models were fit to predict severe outcomes (ICU admission or death within 28-days of admission to hospital for COVID-19, or a positive PCR result if already admitted) using demographic data and initial results from 30 biomarker tests collected within 3 days of admission or testing positive if already admitted. Cross-validation results showed standard logistic regression had an internal validation median AUC of 0.74 (95% Interval [0.62,0.83]), and external validation AUC of 0.68 [0.61, 0.71]; a Bayesian logistic regression (with horseshoe prior) internal AUC of 0.79 [0.71, 0.87], and external AUC of 0.70 [0.68, 0.71]. Variable selection performed using Bayesian predictive projection determined a four variable model using Age, Urea, Prothrombin time and Neutrophil-Lymphocyte ratio, with a median internal AUC of 0.79 [0.78, 0.80], and external AUC of 0.67 [0.65, 0.69]. We illustrate best-practices protocol for conventional and Bayesian prediction modelling on complex clinical data and reiterate the predictive value of previously identified biomarkers for COVID-19 severity assessment.
AB - We describe several regression models to predict severe outcomes in COVID-19 and challenges present in complex observational medical data. We demonstrate best practices for data curation, cross-validated statistical modelling, and variable selection emphasizing recent Bayesian methods. The study follows a retrospective observational cohort design using multicentre records across National Health Service (NHS) trusts in southwest England, UK. Participants included hospitalised adult patients positive for SARS-CoV 2 during March to October 2020, totalling 843 patients (mean age 71, 45% female, 32% died or needed ICU stay),split into training (n=590) and validation groups (n=253). Models were fit to predict severe outcomes (ICU admission or death within 28-days of admission to hospital for COVID-19, or a positive PCR result if already admitted) using demographic data and initial results from 30 biomarker tests collected within 3 days of admission or testing positive if already admitted. Cross-validation results showed standard logistic regression had an internal validation median AUC of 0.74 (95% Interval [0.62,0.83]), and external validation AUC of 0.68 [0.61, 0.71]; a Bayesian logistic regression (with horseshoe prior) internal AUC of 0.79 [0.71, 0.87], and external AUC of 0.70 [0.68, 0.71]. Variable selection performed using Bayesian predictive projection determined a four variable model using Age, Urea, Prothrombin time and Neutrophil-Lymphocyte ratio, with a median internal AUC of 0.79 [0.78, 0.80], and external AUC of 0.67 [0.65, 0.69]. We illustrate best-practices protocol for conventional and Bayesian prediction modelling on complex clinical data and reiterate the predictive value of previously identified biomarkers for COVID-19 severity assessment.
U2 - 10.1101/2022.09.16.22279985
DO - 10.1101/2022.09.16.22279985
M3 - Preprint
BT - Conventional and Bayesian workflows for clinical prediction modelling of severe Covid-19 outcomes based on clinical biomarker test results
ER -