TY - JOUR
T1 - Multivariate canonical correlation analysis identifies additional genetic variants for chronic kidney disease
AU - Osborne, Amy J
AU - Bierzynska, Agnieszka
AU - Colby, Elizabeth
AU - Andag, Uwe
AU - Kalra, Philip A
AU - Radresa, Olivier
AU - Skroblin, Philipp
AU - Taal, Maarten W
AU - Welsh, Gavin I
AU - Saleem, Moin A
AU - Campbell, Colin
N1 - Publisher Copyright:
© The Author(s) 2024.
PY - 2024/3/9
Y1 - 2024/3/9
N2 - Chronic kidney diseases (CKD) have genetic associations with kidney function. Univariate genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with estimated glomerular filtration rate (eGFR) and blood urea nitrogen (BUN), two complementary kidney function markers. However, it is unknown whether additional SNPs for kidney function can be identified by multivariate statistical analysis. To address this, we applied canonical correlation analysis (CCA), a multivariate method, to two individual-level CKD genotype datasets, and metaCCA to two published GWAS summary statistics datasets. We identified SNPs previously associated with kidney function by published univariate GWASs with high replication rates, validating the metaCCA method. We then extended discovery and identified previously unreported lead SNPs for both kidney function markers, jointly. These showed expression quantitative trait loci (eQTL) colocalisation with genes having significant differential expression between CKD and healthy individuals. Several of these identified lead missense SNPs were predicted to have a functional impact, including in SLC14A2. We also identified previously unreported lead SNPs that showed significant correlation with both kidney function markers, jointly, in the European ancestry CKDGen, National Unified Renal Translational Research Enterprise (NURTuRE)-CKD and Salford Kidney Study (SKS) datasets. Of these, rs3094060 colocalised with FLOT1 gene expression and was significantly more common in CKD cases in both NURTURE-CKD and SKS, than in the general population. Overall, by using multivariate analysis by CCA, we identified additional SNPs and genes for both kidney function and CKD, that can be prioritised for further CKD analyses.
AB - Chronic kidney diseases (CKD) have genetic associations with kidney function. Univariate genome-wide association studies (GWAS) have identified single nucleotide polymorphisms (SNPs) associated with estimated glomerular filtration rate (eGFR) and blood urea nitrogen (BUN), two complementary kidney function markers. However, it is unknown whether additional SNPs for kidney function can be identified by multivariate statistical analysis. To address this, we applied canonical correlation analysis (CCA), a multivariate method, to two individual-level CKD genotype datasets, and metaCCA to two published GWAS summary statistics datasets. We identified SNPs previously associated with kidney function by published univariate GWASs with high replication rates, validating the metaCCA method. We then extended discovery and identified previously unreported lead SNPs for both kidney function markers, jointly. These showed expression quantitative trait loci (eQTL) colocalisation with genes having significant differential expression between CKD and healthy individuals. Several of these identified lead missense SNPs were predicted to have a functional impact, including in SLC14A2. We also identified previously unreported lead SNPs that showed significant correlation with both kidney function markers, jointly, in the European ancestry CKDGen, National Unified Renal Translational Research Enterprise (NURTuRE)-CKD and Salford Kidney Study (SKS) datasets. Of these, rs3094060 colocalised with FLOT1 gene expression and was significantly more common in CKD cases in both NURTURE-CKD and SKS, than in the general population. Overall, by using multivariate analysis by CCA, we identified additional SNPs and genes for both kidney function and CKD, that can be prioritised for further CKD analyses.
KW - Humans
KW - Genome-Wide Association Study/methods
KW - Canonical Correlation Analysis
KW - Renal Insufficiency, Chronic/genetics
KW - Kidney
KW - Quantitative Trait Loci/genetics
U2 - 10.1038/s41540-024-00350-8
DO - 10.1038/s41540-024-00350-8
M3 - Article (Academic Journal)
C2 - 38459044
SN - 2056-7189
VL - 10
SP - 15
JO - npj Systems Biology and Applications
JF - npj Systems Biology and Applications
IS - 1
M1 - 28
ER -