Abstract
The Horizon2020 LifeCycle Project is a cross-cohort collaboration which brings together data from multiple birth cohorts from across Europe and Australia to facilitate studies on the influence of earlylife exposures on later health outcomes. A major product of this collaboration has been the establishment of a FAIR (findable, accessible, interoperable and reusable) data resource known as the EU Child Cohort Network.
Here we focus on the EU Child Cohort Network’s core variables. These are a set of basic variables, derivable by the majority of participating cohorts and frequently used as covariates or exposures in lifecourse research. First, we describe the process by which the list of core variables was established. Second, we explain the protocol according to which these variables were harmonised in order to make them interoperable. Third, we describe the catalogue developed to ensure that the network’s data are findable and reusable. Finally, we describe the core data, including the proportion of variables harmonised by each cohort and the number of children for whom harmonised core data are available.
EU Child Cohort Network data will be analysed using a federated analysis platform, removing the need to physically transfer data and thus making the data more accessible to researchers. The network will add value to participating cohorts by increasing statistical power and exposure heterogeneity, as well as facilitating cross-cohort comparisons, cross-validation and replication. Our aim is to motivate other cohorts to join the network and encourage the use of the EU Child Cohort Network by the wider research community.
Here we focus on the EU Child Cohort Network’s core variables. These are a set of basic variables, derivable by the majority of participating cohorts and frequently used as covariates or exposures in lifecourse research. First, we describe the process by which the list of core variables was established. Second, we explain the protocol according to which these variables were harmonised in order to make them interoperable. Third, we describe the catalogue developed to ensure that the network’s data are findable and reusable. Finally, we describe the core data, including the proportion of variables harmonised by each cohort and the number of children for whom harmonised core data are available.
EU Child Cohort Network data will be analysed using a federated analysis platform, removing the need to physically transfer data and thus making the data more accessible to researchers. The network will add value to participating cohorts by increasing statistical power and exposure heterogeneity, as well as facilitating cross-cohort comparisons, cross-validation and replication. Our aim is to motivate other cohorts to join the network and encourage the use of the EU Child Cohort Network by the wider research community.
Original language | English |
---|---|
Pages (from-to) | 565-580 |
Number of pages | 16 |
Journal | European Journal of Epidemiology |
Volume | 36 |
Issue number | 5 |
Early online date | 21 Apr 2021 |
DOIs | |
Publication status | Published - May 2021 |
Bibliographical note
Funding Information:The LifeCycle project received funding from the European Union’s Horizon 2020 research and innovation programme (Grant Agreement No. 733206 LifeCycle). All study specific acknowledgements and funding are presented in the supplementary material. This manuscript reflects only the author’s view and the Commission is not responsible for any use that may be made of the information it contains.
Publisher Copyright:
© 2021, The Author(s).
Keywords
- birth cohort
- cross-cohort collaboration
- lifecourse epidemiology
- data harmonisation
- FAIR (findable, accessible, interoperable and reusable) principles