The EU Child Cohort Network’s core data: establishing a set of findable, accessible, interoperable and re-usable (FAIR) variables

Angela Pinot de Moira*, Sido Haakma, Katrine Strandberg-Larsen, Esther van Enckevort, Marjolein Kooijman, Tim Cadman, Marloes Cardol, Eva Corpeleijn, Sarah R Crozier, Liesbeth Duijts, Ahmed Elhakeem, Johan Eriksson, Janine F Felix, Sílvia Fernández-Barrés, Rachel Foong, Anne Forhan, Veit Grote, Kathrin Guerlich, Barbara Heude, Rae-Chi HuangMarjo-Riitta Järvelin, Anne Cathrine Jørgensen, Tuija Mikkola, Johanna LT Nader, Marie Pedersen, Maja Popovic, N Rautio, Lorenzo Richiardi, Justiina Ronkainen, T Roumeliotaki, Theodosia Salika, Sylvain P Sebert, Johan Vinther, Ellis Voerman, Martine Vrijheid, John Wright, Tiffany Yang, Faryal Zariouh, Marie-Aline Charles, Hazel M Inskip, Vincent W V Jaddoe, Morris A. Swertz, Anne-Marie Nybo Andersen

*Corresponding author for this work

Research output: Contribution to journalArticle (Academic Journal)peer-review

Abstract

The Horizon2020 LifeCycle Project is a cross-cohort collaboration which brings together data from multiple birth cohorts from across Europe and Australia to facilitate studies on the influence of earlylife exposures on later health outcomes. A major product of this collaboration has been the establishment of a FAIR (findable, accessible, interoperable and reusable) data resource known as the EU Child Cohort Network.

Here we focus on the EU Child Cohort Network’s core variables. These are a set of basic variables, derivable by the majority of participating cohorts and frequently used as covariates or exposures in lifecourse research. First, we describe the process by which the list of core variables was established. Second, we explain the protocol according to which these variables were harmonised in order to make them interoperable. Third, we describe the catalogue developed to ensure that the network’s data are findable and reusable. Finally, we describe the core data, including the proportion of variables harmonised by each cohort and the number of children for whom harmonised core data are available.

EU Child Cohort Network data will be analysed using a federated analysis platform, removing the need to physically transfer data and thus making the data more accessible to researchers. The network will add value to participating cohorts by increasing statistical power and exposure heterogeneity, as well as facilitating cross-cohort comparisons, cross-validation and replication. Our aim is to motivate other cohorts to join the network and encourage the use of the EU Child Cohort Network by the wider research community.
Original languageEnglish
JournalEuropean Journal of Epidemiology
Publication statusAccepted/In press - 12 Feb 2021

Cite this