TY - JOUR
T1 - A roadmap for high-throughput sequencing studies of wild animal populations using noninvasive samples and hybridization capture
AU - White, Lauren C.
AU - Fontsere, Claudia
AU - Lizano, Esther
AU - Hughes, David A.
AU - Angedakin, Samuel
AU - Arandjelovic, Mimi
AU - Granjon, Anne Céline
AU - Hans, Jörg B.
AU - Lester, Jack D.
AU - Rabanus-Wallace, M. Timothy
AU - Rowney, Carolyn
AU - Städele, Veronika
AU - Marques-Bonet, Tomas
AU - Langergraber, Kevin E.
AU - Vigilant, Linda
N1 - © 2019 John Wiley & Sons Ltd.
PY - 2019/5/1
Y1 - 2019/5/1
N2 - Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although non-invasive samples, such as feces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) fecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture, and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2-3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.
AB - Large-scale genomic studies of wild animal populations are often limited by access to high-quality DNA. Although non-invasive samples, such as feces, can be readily collected, DNA from the sample producers is usually present in low quantities, fragmented, and contaminated by microorganism and dietary DNAs. Hybridization capture can help overcome these impediments by increasing the proportion of subject DNA prior to high-throughput sequencing. Here we evaluate a key design variable for hybridization capture, the number of rounds of capture, by testing whether one or two rounds are most appropriate, given varying sample quality (as measured by the ratios of subject to total DNA). We used a set of 1,780 quality-assessed wild chimpanzee (Pan troglodytes schweinfurthii) fecal samples and chose 110 samples of varying quality for exome capture and sequencing. We used multiple regression to assess the effects of the ratio of subject to total DNA (sample quality), rounds of capture, and sequencing effort on the number of unique exome reads sequenced. We not only show that one round of capture is preferable when the proportion of subject DNA in a sample is above ~2-3%, but also explore various types of bias introduced by capture, and develop a model that predicts the sequencing effort necessary for a desired data yield from samples of a given quality. Thus our results provide a useful guide and pave a methodological way forward for researchers wishing to plan similar hybridization capture studies.
KW - Animals
KW - Animals, Wild
KW - DNA/chemistry
KW - Feces/chemistry
KW - Genetics, Population/methods
KW - High-Throughput Nucleotide Sequencing/methods
KW - Nucleic Acid Hybridization/methods
KW - Pan troglodytes
UR - http://www.scopus.com/inward/record.url?scp=85064595943&partnerID=8YFLogxK
U2 - 10.1111/1755-0998.12993
DO - 10.1111/1755-0998.12993
M3 - Article (Academic Journal)
C2 - 30637963
AN - SCOPUS:85064595943
VL - 19
SP - 609
EP - 622
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
SN - 1755-098X
IS - 3
ER -