Abstract
We are facing a major challenge in bridging the gap between identifying subtypes of asthma, to understanding causal mechanisms, and translating this knowledge into personalized prevention and management strategies. In recent years, "big data" has been sold as a panacea for generating hypotheses and driving new frontiers of healthcare; the idea that the data must and will speak for themselves is fast becoming a new dogma. One of the dangers of ready accessibility of healthcare data and computational tools for data analysis is that the process of data mining may become uncoupled from the scientific process of clinical interpretation, understanding the provenance of the data and external validation. Although advances in computational methods can be valuable for using unexpected structure in data to generate hypotheses, there remains a need for testing hypotheses and interpreting results with scientific rigor. We argue for combining data-driven and hypothesis-driven methods in a careful synergy, and the importance of carefully characterized birth and patient cohorts with genetic, phenotypic, biological and molecular data in this process cannot be overemphasized. The main challenge on the road ahead is to harness 'bigger' healthcare data in ways that produce meaningful clinical interpretation and to translate this into better diagnoses and properly personalized prevention and treatment plans. There is a pressing need for cross-disciplinary research with an integrative approach to data science, whereby basic scientists, clinicians, data analysts and epidemiologists work together to understand the heterogeneity of asthma.
Original language | English |
---|---|
Pages (from-to) | 400-407 |
Number of pages | 8 |
Journal | Journal of Allergy and Clinical Immunology |
Volume | 139 |
Issue number | 2 |
Early online date | 18 Nov 2016 |
DOIs | |
Publication status | Published - Feb 2017 |
Keywords
- Asthma
- endotypes
- machine learning
- big data
- birth cohorts