TY - JOUR
T1 - Maximizing the acquisition of unique reads in non-invasive capture sequencing experiments
AU - Fontsere, Claudia
AU - Alvarez-Estape, Marina
AU - Lester, Jack
AU - Arandjelovic, Mimi
AU - Kuhlwilm, Martin
AU - Dieguez, Paula
AU - Agbor, Anthony
AU - Angedakin, Samuel
AU - Ayimisin, Emmanuel Ayuk
AU - Bessone, Mattia
AU - Brazzola, Gregory
AU - Deschner, Tobias
AU - Eno-Nku, Manasseh
AU - Granjon, Anne-Céline
AU - Head, Josephine
AU - Kadam, Parag
AU - Kalan, Ammie K
AU - Kambi, Mohamed
AU - Langergraber, Kevin
AU - Lapuente, Juan
AU - Maretti, Giovanna
AU - Piel, Alex
AU - Robbins, Martha M
AU - Stewart, Fiona
AU - Vergnes, Virginie
AU - Wittig, Roman M
AU - Kühl, Hjalmar S
AU - Marques-Bonet, Tomas
AU - Hughes, David A
AU - Lizano, Esther
PY - 2021/3/11
Y1 - 2021/3/11
N2 - Non-invasive samples as a source of DNA are gaining interest in genomic studies of endangered species. However, their complex nature and low endogenous DNA content hamper the recovery of good quality data. Target capture has become a productive method to enrich the endogenous fraction of non-invasive samples, such as feces, but its sensitivity has not yet been extensively studied. Coping with fecal samples with an endogenous DNA content below 1% is a common problem when prior selection of samples from a large collection is not possible. However, samples classified as unfavorable for target capture sequencing might be the only representatives of unique specific geographical locations or to answer the question of interest. To explore how library complexity may be increased without repeating DNA extractions and generating new libraries, here we have captured the exome of 60 chimpanzees (Pan troglodytes) using fecal samples with very low proportions of endogenous content (< 1%). Our results indicate that by performing additional hybridizations of the same libraries, the molecular complexity can be maintained to achieve higher coverage. Also, whenever possible, the starting DNA material for capture should be increased. Lastly, we have specifically calculated the sequencing effort needed to avoid exhausting the library complexity of enriched fecal samples with low endogenous DNA content. This study provides guidelines, schemes and tools for laboratories facing the challenges of working with non-invasive samples containing extremely low amounts of endogenous DNA.
AB - Non-invasive samples as a source of DNA are gaining interest in genomic studies of endangered species. However, their complex nature and low endogenous DNA content hamper the recovery of good quality data. Target capture has become a productive method to enrich the endogenous fraction of non-invasive samples, such as feces, but its sensitivity has not yet been extensively studied. Coping with fecal samples with an endogenous DNA content below 1% is a common problem when prior selection of samples from a large collection is not possible. However, samples classified as unfavorable for target capture sequencing might be the only representatives of unique specific geographical locations or to answer the question of interest. To explore how library complexity may be increased without repeating DNA extractions and generating new libraries, here we have captured the exome of 60 chimpanzees (Pan troglodytes) using fecal samples with very low proportions of endogenous content (< 1%). Our results indicate that by performing additional hybridizations of the same libraries, the molecular complexity can be maintained to achieve higher coverage. Also, whenever possible, the starting DNA material for capture should be increased. Lastly, we have specifically calculated the sequencing effort needed to avoid exhausting the library complexity of enriched fecal samples with low endogenous DNA content. This study provides guidelines, schemes and tools for laboratories facing the challenges of working with non-invasive samples containing extremely low amounts of endogenous DNA.
KW - non-invasive samples
KW - fecal samples
KW - target capture
KW - molecular complexity
KW - conservation genomics
KW - chimpanzees
U2 - 10.1111/1755-0998.13300
DO - 10.1111/1755-0998.13300
M3 - Article (Academic Journal)
C2 - 33217149
SN - 1755-098X
JO - Molecular Ecology Resources
JF - Molecular Ecology Resources
ER -