Toolkit for automated and rapid discovery of structural variants

Arda Soylev, Can Kockan, Fereydoun Hormozdiari, Can Alkan

Research output: Contribution to journalArticle

8 Scopus citations

Abstract

Structural variations (SV) are broadly defined as genomic alterations that affect >50 bp of DNA, which are shown to have significant effect on evolution and disease. The advent of high throughput sequencing (HTS) technologies and the ability to perform whole genome sequencing (WGS), makes it feasible to study these variants in depth. However, discovery of all forms of SV using WGS has proven to be challenging as the short reads produced by the predominant HTS platforms (<200 bp for current technologies) and the fact that most genomes include large amounts of repeats make it very difficult to unambiguously map and accurately characterize such variants. Furthermore, existing tools for SV discovery are primarily developed for only a few of the SV types, which may have conflicting sequence signatures (i.e. read pairs, read depth, split reads) with other, untargeted SV classes. Here we are introduce a new framework, TARDIS, which combines multiple read signatures into a single package to characterize most SV types simultaneously, while preventing such conflicts. TARDIS also has a modular structure that makes it easy to extend for the discovery of additional forms of SV.

Original languageEnglish (US)
Pages (from-to)3-7
Number of pages5
JournalMethods
Volume129
DOIs
StatePublished - Oct 1 2017

Keywords

  • Combinatorial algorithms
  • High throughput sequencing
  • Structural variation

ASJC Scopus subject areas

  • Molecular Biology
  • Biochemistry, Genetics and Molecular Biology(all)

Fingerprint Dive into the research topics of 'Toolkit for automated and rapid discovery of structural variants'. Together they form a unique fingerprint.

  • Cite this