Automated recognition of RNA structure motifs by their SHAPE data signatures

Pierce Radecki, Mirko Ledda, Sharon Aviran

Research output: Contribution to journalArticle

5 Citations (Scopus)

Abstract

High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA, an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA. We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.

Original languageEnglish (US)
Article number300
JournalGenes
Volume9
Issue number6
DOIs
StatePublished - Jun 14 2018

Fingerprint

RNA
HIV-1
Automation
Response Elements
Nucleotides
Genome
RNA Recognition Motif
Datasets

Keywords

  • HIV
  • HMM-GMM
  • Machine learning
  • RNA structure
  • RRE
  • SHAPE
  • Structure probing

ASJC Scopus subject areas

  • Genetics
  • Genetics(clinical)

Cite this

Automated recognition of RNA structure motifs by their SHAPE data signatures. / Radecki, Pierce; Ledda, Mirko; Aviran, Sharon.

In: Genes, Vol. 9, No. 6, 300, 14.06.2018.

Research output: Contribution to journalArticle

@article{b63f7ad5f18545bdb28235e0c34d31a9,
title = "Automated recognition of RNA structure motifs by their SHAPE data signatures",
abstract = "High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA, an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA. We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.",
keywords = "HIV, HMM-GMM, Machine learning, RNA structure, RRE, SHAPE, Structure probing",
author = "Pierce Radecki and Mirko Ledda and Sharon Aviran",
year = "2018",
month = "6",
day = "14",
doi = "10.3390/genes9060300",
language = "English (US)",
volume = "9",
journal = "Genes",
issn = "2073-4425",
publisher = "Multidisciplinary Digital Publishing Institute (MDPI)",
number = "6",

}

TY - JOUR

T1 - Automated recognition of RNA structure motifs by their SHAPE data signatures

AU - Radecki, Pierce

AU - Ledda, Mirko

AU - Aviran, Sharon

PY - 2018/6/14

Y1 - 2018/6/14

N2 - High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA, an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA. We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.

AB - High-throughput structure profiling (SP) experiments that provide information at nucleotide resolution are revolutionizing our ability to study RNA structures. Of particular interest are RNA elements whose underlying structures are necessary for their biological functions. We previously introduced patteRNA, an algorithm for rapidly mining SP data for patterns characteristic of such motifs. This work provided a proof-of-concept for the detection of motifs and the capability of distinguishing structures displaying pronounced conformational changes. Here, we describe several improvements and automation routines to patteRNA. We then consider more elaborate biological situations starting with the comparison or integration of results from searches for distinct motifs and across datasets. To facilitate such analyses, we characterize patteRNA’s outputs and describe a normalization framework that regularizes results. We then demonstrate that our algorithm successfully discerns between highly similar structural variants of the human immunodeficiency virus type 1 (HIV-1) Rev response element (RRE) and readily identifies its exact location in whole-genome structure profiles of HIV-1. This work highlights the breadth of information that can be gleaned from SP data and broadens the utility of data-driven methods as tools for the detection of novel RNA elements.

KW - HIV

KW - HMM-GMM

KW - Machine learning

KW - RNA structure

KW - RRE

KW - SHAPE

KW - Structure probing

UR - http://www.scopus.com/inward/record.url?scp=85048707156&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85048707156&partnerID=8YFLogxK

U2 - 10.3390/genes9060300

DO - 10.3390/genes9060300

M3 - Article

AN - SCOPUS:85048707156

VL - 9

JO - Genes

JF - Genes

SN - 2073-4425

IS - 6

M1 - 300

ER -