Explainable hierarchical clustering for patient subtyping and risk prediction

Enrico Werner*, Jeffrey N. Clark, Alexander Hepburn, Ranjeet S. Bhamber, Michael Ambler, Christopher P. Bourdeaux, Christopher J. McWilliams, Raul Santos-Rodriguez

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

5 Citations (Scopus)

Abstract

We present a pipeline in which machine learning techniques are used to automatically identify and evaluate subtypes of hospital patients admitted between 2017 and 2021 in a large UK teaching hospital. Patient clusters are determined using routinely collected hospital data, such as those used in the UK’s National Early Warning Score 2 (NEWS2). An iterative, hierarchical clustering process was used to identify the minimum set of relevant features for cluster separation. With the use of state-of-the-art explainability techniques, the identified subtypes are interpreted and assigned clinical meaning, illustrating their robustness. In parallel, clinicians assessed intracluster similarities and intercluster differences of the identified patient subtypes within the context of their clinical knowledge. For each cluster, outcome prediction models were trained and their forecasting ability was illustrated against the NEWS2 of the unclustered patient cohort. These preliminary results suggest that subtype models can outperform the established NEWS2 method, providing improved prediction of patient deterioration. By considering both the computational outputs and clinician-based explanations in patient subtyping, we aim to highlight the mutual benefit of combining machine learning techniques with clinical expertise.

Original languageEnglish
JournalExperimental Biology and Medicine
DOIs
Publication statusPublished - 15 Dec 2023

Bibliographical note

Funding Information:
The author(s) disclosed receipt of the following financial support for the research, authorship, and/or publication of this article: This work was funded by Health Data Research UK via the Better Care Partnership Southwest (HDR CF0129) within the P-NEWS project (personalized early warning scores for preventing unplanned critical admissions); EW, JNC, AH, and RSR are funded by the UKRI Turing AI Fellowship (grant no. EP/V024817/1).

Publisher Copyright:
© 2023 by the Society for Experimental Biology and Medicine.

Keywords

  • clinical evaluation
  • early warning score
  • explainability
  • Hierarchical clustering
  • mortality prediction
  • patient subtypes

Fingerprint

Dive into the research topics of 'Explainable hierarchical clustering for patient subtyping and risk prediction'. Together they form a unique fingerprint.

Cite this