TY - JOUR
T1 - A two-dimensional pooling strategy for rare variant detection on next-generation sequencing platforms
AU - Zuzarte, Philip C.
AU - Denroche, Robert E.
AU - Fehringer, Gordon
AU - Katzov-Eckert, Hagit
AU - Hung, Rayjean J.
AU - Mcpherson, John Douglas
PY - 2014/4/11
Y1 - 2014/4/11
N2 - We describe a method for pooling and sequencing DNA from a large number of individual samples while preserving information regarding sample identity. DNA from 576 individuals was arranged into four 12 row by 12 column matrices and then pooled by row and by column resulting in 96 total pools with 12 individuals in each pool. Pooling of DNA was carried out in a two-dimensional fashion, such that DNA from each individual is present in exactly one row pool and exactly one column pool. By considering the variants observed in the rows and columns of a matrix we are able to trace rare variants back to the specific individuals that carry them. The pooled DNA samples were enriched over a 250 kb region previously identified by GWAS to significantly predispose individuals to lung cancer. All 96 pools (12 row and 12 column pools from 4 matrices) were barcoded and sequenced on an Illumina HiSeq 2000 instrument with an average depth of coverage greater than 4,0006. Verification based on Ion PGM sequencing confirmed the presence of 91.4% of confidently classified SNVs assayed. In this way, each individual sample is sequenced in multiple pools providing more accurate variant calling than a single pool or a multiplexed approach. This provides a powerful method for rare variant detection in regions of interest at a reduced cost to the researcher.
AB - We describe a method for pooling and sequencing DNA from a large number of individual samples while preserving information regarding sample identity. DNA from 576 individuals was arranged into four 12 row by 12 column matrices and then pooled by row and by column resulting in 96 total pools with 12 individuals in each pool. Pooling of DNA was carried out in a two-dimensional fashion, such that DNA from each individual is present in exactly one row pool and exactly one column pool. By considering the variants observed in the rows and columns of a matrix we are able to trace rare variants back to the specific individuals that carry them. The pooled DNA samples were enriched over a 250 kb region previously identified by GWAS to significantly predispose individuals to lung cancer. All 96 pools (12 row and 12 column pools from 4 matrices) were barcoded and sequenced on an Illumina HiSeq 2000 instrument with an average depth of coverage greater than 4,0006. Verification based on Ion PGM sequencing confirmed the presence of 91.4% of confidently classified SNVs assayed. In this way, each individual sample is sequenced in multiple pools providing more accurate variant calling than a single pool or a multiplexed approach. This provides a powerful method for rare variant detection in regions of interest at a reduced cost to the researcher.
UR - http://www.scopus.com/inward/record.url?scp=84899655494&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84899655494&partnerID=8YFLogxK
U2 - 10.1371/journal.pone.0093455
DO - 10.1371/journal.pone.0093455
M3 - Article
C2 - 24728235
AN - SCOPUS:84899655494
VL - 9
JO - PLoS One
JF - PLoS One
SN - 1932-6203
IS - 4
M1 - e93455
ER -