Abstract
BACKGROUND: Heart failure (HF) with preserved or mildly reduced ejection fraction includes a heterogenous group of patients. Reclassification into distinct phenogroups to enable targeted interventions is a priority. This study aimed to identify distinct phenogroups, and compare phenogroup characteristics and outcomes, from electronic health record data.
METHODS: 2,187 patients admitted to five UK hospitals with a diagnosis of HF and a left ventricular ejection fraction ≥ 40% were identified from the NIHR Health Informatics Collaborative database. Partition-based, model-based, and density-based machine learning clustering techniques were applied. Cox Proportional Hazards and Fine-Gray competing risks models were used to compare outcomes (all-cause mortality and hospitalisation for HF) across phenogroups.
RESULTS: Three phenogroups were identified: (1) Younger, predominantly female patients with high prevalence of cardiometabolic and coronary disease; (2) More frail patients, with higher rates of lung disease and atrial fibrillation; (3) Patients characterised by systemic inflammation and high rates of diabetes and renal dysfunction. Survival profiles were distinct, with an increasing risk of all-cause mortality from phenogroups 1 to 3 (p < 0.001). Phenogroup membership significantly improved survival prediction compared to conventional factors. Phenogroups were not predictive of hospitalisation for HF.
CONCLUSIONS: Applying unsupervised machine learning to routinely collected electronic health record data identified phenogroups with distinct clinical characteristics and unique survival profiles.
Original language | English |
---|---|
Article number | 343 |
Journal | BMC Cardiovascular Disorders |
Volume | 24 |
DOIs | |
Publication status | Published - 5 Jul 2024 |
Bibliographical note
Publisher Copyright:© The Author(s) 2024.
Keywords
- Humans
- Heart Failure/physiopathology
- Electronic Health Records
- Female
- Male
- Aged
- Stroke Volume
- Middle Aged
- Risk Assessment
- Ventricular Function, Left
- United Kingdom/epidemiology
- Risk Factors
- Prognosis
- Aged, 80 and over
- Databases, Factual
- Unsupervised Machine Learning
- Hospitalization
- Time Factors
- Comorbidity
- Cause of Death
- Phenotype
- Data Mining