TY - JOUR
T1 - SeqControl
T2 - Process control for DNA sequencing
AU - Chong, Lauren C.
AU - Albuquerque, Marco A.
AU - Harding, Nicholas J.
AU - Caloian, Cristian
AU - Chan-Seng-Yue, Michelle
AU - De Borja, Richard
AU - Fraser, Michael
AU - Denroche, Robert E.
AU - Beck, Timothy A.
AU - Van Der Kwast, Theodorus
AU - Bristow, Robert G.
AU - Mcpherson, John Douglas
AU - Boutros, Paul C.
PY - 2014/1/1
Y1 - 2014/1/1
N2 - As high-throughput sequencing continues to increase in speed and throughput, routine clinical and industrial application draws closer. These 'production' settings will require enhanced quality monitoring and quality control to optimize output and reduce costs. We developed SeqControl, a framework for predicting sequencing quality and coverage using a set of 15 metrics describing overall coverage, coverage distribution, basewise coverage and basewise quality. Using whole-genome sequences of 27 prostate cancers and 26 normal references, we derived multivariate models that predict sequencing quality and depth. SeqControl robustly predicted how much sequencing was required to reach a given coverage depth (area under the curve (AUC) = 0.993), accurately classified clinically relevant formalin-fixed, paraffin-embedded samples, and made predictions from as little as one-eighth of a sequencing lane (AUC = 0.967). These techniques can be immediately incorporated into existing sequencing pipelines to monitor data quality in real time. SeqControl is available at http://labs.oicr.on.ca/Boutros-lab/software/SeqControl/.
AB - As high-throughput sequencing continues to increase in speed and throughput, routine clinical and industrial application draws closer. These 'production' settings will require enhanced quality monitoring and quality control to optimize output and reduce costs. We developed SeqControl, a framework for predicting sequencing quality and coverage using a set of 15 metrics describing overall coverage, coverage distribution, basewise coverage and basewise quality. Using whole-genome sequences of 27 prostate cancers and 26 normal references, we derived multivariate models that predict sequencing quality and depth. SeqControl robustly predicted how much sequencing was required to reach a given coverage depth (area under the curve (AUC) = 0.993), accurately classified clinically relevant formalin-fixed, paraffin-embedded samples, and made predictions from as little as one-eighth of a sequencing lane (AUC = 0.967). These techniques can be immediately incorporated into existing sequencing pipelines to monitor data quality in real time. SeqControl is available at http://labs.oicr.on.ca/Boutros-lab/software/SeqControl/.
UR - http://www.scopus.com/inward/record.url?scp=84921758024&partnerID=8YFLogxK
UR - http://www.scopus.com/inward/citedby.url?scp=84921758024&partnerID=8YFLogxK
U2 - 10.1038/nmeth.3094
DO - 10.1038/nmeth.3094
M3 - Article
C2 - 25173705
AN - SCOPUS:84921758024
VL - 11
SP - 1071
EP - 1075
JO - PLoS Medicine
JF - PLoS Medicine
SN - 1549-1277
IS - 10
ER -