Projects per year
Abstract
Motivation: Fine mapping is a widely used approach for identifying the causal variant(s) at disease-associated loci. Standard methods (e.g. multiple regression) require individual level genotypes. Recent fine mapping methods using summary-level data require the pairwise correlation coefficients (r2) of the variants. However, haplotypes rather than pairwise r2, are the true biological representation of linkage disequilibrium (LD) among multiple loci. In this paper, we present an empirical iterative method, HAPlotype Regional Association analysis Program (HAPRAP), that enables fine mapping using summary statistics and haplotype information from an individual-level reference panel.
Results: Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits (GIANT) height data, HAPRAP performs well with a small training sample size (N<2000) while other methods become suboptimal. Moreover, HAPRAP’s performance is not affected substantially by SNPs with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization).
Availability: The HAPRAP package and documentation are available online: http://apps.biocompute.org.uk/haprap/
Results: Simulations with individual-level genotypes show that the results of HAPRAP and multiple regression are highly consistent. In simulation with summary-level data, we demonstrate that HAPRAP is less sensitive to poor LD estimates. In a parametric simulation using Genetic Investigation of ANthropometric Traits (GIANT) height data, HAPRAP performs well with a small training sample size (N<2000) while other methods become suboptimal. Moreover, HAPRAP’s performance is not affected substantially by SNPs with low minor allele frequencies. We applied the method to existing quantitative trait and binary outcome meta-analyses (human height, QTc interval and gallbladder disease); all previous reported association signals were replicated and two additional variants were independently associated with human height. Due to the growing availability of summary level data, the value of HAPRAP is likely to increase markedly for future analyses (e.g. functional prediction and identification of instruments for Mendelian randomization).
Availability: The HAPRAP package and documentation are available online: http://apps.biocompute.org.uk/haprap/
Original language | English |
---|---|
Pages (from-to) | 79-86 |
Number of pages | 8 |
Journal | Bioinformatics |
Volume | 33 |
Issue number | 1 |
Early online date | 1 Sept 2016 |
DOIs | |
Publication status | Published - Jan 2017 |
Fingerprint
Dive into the research topics of 'HAPRAP: a haplotype-based iterative method for statistical fine mapping using GWAS summary statistics'. Together they form a unique fingerprint.Projects
- 1 Finished
-
IEU Theme 2
Flach, P. A. (Principal Investigator), Gaunt, T. R. (Principal Investigator) & Gaunt, T. R. (Principal Investigator)
1/06/13 → 31/03/18
Project: Research
Profiles
-
Professor Tom R Gaunt
- Bristol Medical School (PHS) - Professor of Health and Biomedical Informatics and MRC Investigator
- Bristol Population Health Science Institute
- MRC Integrative Epidemiology Unit - Programme lead
Person: Academic , Member