Modeling and automation of sequencing-based characterization of RNA structure

Sharon Aviran, Cole Trapnell, Julius B. Lucks, Stefanie A. Mortimer, Shujun Luo, Gary P. Schroth, Jennifer A. Doudna, Adam P. Arkin, Lior Pachter

Research output: Contribution to journalArticle

81 Scopus citations

Abstract

Sequence census methods reduce molecular measurements such as transcript abundance and protein-nucleic acid interactions to counting problems via DNA sequencing. We focus on a novel assay utilizing this approach, called selective 2′-hydroxyl acylation analyzed by primer extension sequencing (SHAPE-Seq), that can be used to characterize RNA secondary and tertiary structure. We describe a fully automated data analysis pipeline for SHAPE-Seq analysis that includes read processing, mapping, and structural inference based on a model of the experiment. Our methods rely on the solution of a series of convex optimization problems for which we develop efficient and effective numerical algorithms. Our results can be easily extended to other chemical probes of RNA structure, and also generalized to modeling polymerase drop-off in other sequence census-based experiments.

Original languageEnglish (US)
Pages (from-to)11069-11074
Number of pages6
JournalProceedings of the National Academy of Sciences of the United States of America
Volume108
Issue number27
DOIs
StatePublished - Jul 5 2011

Keywords

  • Chemical mapping
  • Next generation sequencing
  • RNA folding
  • RNA sequencing
  • Signal processing

ASJC Scopus subject areas

  • General

Fingerprint Dive into the research topics of 'Modeling and automation of sequencing-based characterization of RNA structure'. Together they form a unique fingerprint.

  • Cite this

    Aviran, S., Trapnell, C., Lucks, J. B., Mortimer, S. A., Luo, S., Schroth, G. P., Doudna, J. A., Arkin, A. P., & Pachter, L. (2011). Modeling and automation of sequencing-based characterization of RNA structure. Proceedings of the National Academy of Sciences of the United States of America, 108(27), 11069-11074. https://doi.org/10.1073/pnas.1106541108