Extracting information from RNA SHAPE data: Kalman filtering approach

Sana Vaziri, Patrice Koehl, Sharon Aviran

Research output: Contribution to journalArticle

2 Citations (Scopus)

Abstract

RNA SHAPE experiments have become important and successful sources of information for RNA structure prediction. In such experiments, chemical reagents are used to probe RNA backbone flexibility at the nucleotide level, which in turn provides information on base pairing and therefore secondary structure. Little is known, however, about the statistics of such SHAPE data. In this work, we explore different representations of noise in SHAPE data and propose a statistically sound framework for extracting reliable reactivity information from multiple SHAPE replicates. Our analyses of RNA SHAPE experiments underscore that a normal noise model is not adequate to represent their data. We propose instead a log-normal representation of noise and discuss its relevance. Under this assumption, we observe that processing simulated SHAPE data by directly averaging different replicates leads to bias. Such bias can be reduced by analyzing the data following a log transformation, either by log-averaging or Kalman filtering. Application of Kalman filtering has the additional advantage that a prior on the nucleotide reactivities can be introduced. We show that the performance of Kalman filtering is then directly dependent on the quality of that prior. We conclude the paper with guidelines on signal processing of RNA SHAPE data.

Original languageEnglish (US)
Pages (from-to)e0207029
JournalPLoS One
Volume13
Issue number11
DOIs
StatePublished - Jan 1 2018

Fingerprint

RNA
Noise
Nucleotides
nucleotides
RNA probes
RNA Probes
Experiments
information sources
Acoustic noise
Base Pairing
Signal processing
statistics
Statistics
Acoustic waves
Guidelines
prediction
Processing

ASJC Scopus subject areas

  • Biochemistry, Genetics and Molecular Biology(all)
  • Agricultural and Biological Sciences(all)

Cite this

Extracting information from RNA SHAPE data : Kalman filtering approach. / Vaziri, Sana; Koehl, Patrice; Aviran, Sharon.

In: PLoS One, Vol. 13, No. 11, 01.01.2018, p. e0207029.

Research output: Contribution to journalArticle

Vaziri, Sana ; Koehl, Patrice ; Aviran, Sharon. / Extracting information from RNA SHAPE data : Kalman filtering approach. In: PLoS One. 2018 ; Vol. 13, No. 11. pp. e0207029.
@article{7888685a67a040a9944f3cf79d6a8b14,
title = "Extracting information from RNA SHAPE data: Kalman filtering approach",
abstract = "RNA SHAPE experiments have become important and successful sources of information for RNA structure prediction. In such experiments, chemical reagents are used to probe RNA backbone flexibility at the nucleotide level, which in turn provides information on base pairing and therefore secondary structure. Little is known, however, about the statistics of such SHAPE data. In this work, we explore different representations of noise in SHAPE data and propose a statistically sound framework for extracting reliable reactivity information from multiple SHAPE replicates. Our analyses of RNA SHAPE experiments underscore that a normal noise model is not adequate to represent their data. We propose instead a log-normal representation of noise and discuss its relevance. Under this assumption, we observe that processing simulated SHAPE data by directly averaging different replicates leads to bias. Such bias can be reduced by analyzing the data following a log transformation, either by log-averaging or Kalman filtering. Application of Kalman filtering has the additional advantage that a prior on the nucleotide reactivities can be introduced. We show that the performance of Kalman filtering is then directly dependent on the quality of that prior. We conclude the paper with guidelines on signal processing of RNA SHAPE data.",
author = "Sana Vaziri and Patrice Koehl and Sharon Aviran",
year = "2018",
month = "1",
day = "1",
doi = "10.1371/journal.pone.0207029",
language = "English (US)",
volume = "13",
pages = "e0207029",
journal = "PLoS One",
issn = "1932-6203",
publisher = "Public Library of Science",
number = "11",

}

TY - JOUR

T1 - Extracting information from RNA SHAPE data

T2 - Kalman filtering approach

AU - Vaziri, Sana

AU - Koehl, Patrice

AU - Aviran, Sharon

PY - 2018/1/1

Y1 - 2018/1/1

N2 - RNA SHAPE experiments have become important and successful sources of information for RNA structure prediction. In such experiments, chemical reagents are used to probe RNA backbone flexibility at the nucleotide level, which in turn provides information on base pairing and therefore secondary structure. Little is known, however, about the statistics of such SHAPE data. In this work, we explore different representations of noise in SHAPE data and propose a statistically sound framework for extracting reliable reactivity information from multiple SHAPE replicates. Our analyses of RNA SHAPE experiments underscore that a normal noise model is not adequate to represent their data. We propose instead a log-normal representation of noise and discuss its relevance. Under this assumption, we observe that processing simulated SHAPE data by directly averaging different replicates leads to bias. Such bias can be reduced by analyzing the data following a log transformation, either by log-averaging or Kalman filtering. Application of Kalman filtering has the additional advantage that a prior on the nucleotide reactivities can be introduced. We show that the performance of Kalman filtering is then directly dependent on the quality of that prior. We conclude the paper with guidelines on signal processing of RNA SHAPE data.

AB - RNA SHAPE experiments have become important and successful sources of information for RNA structure prediction. In such experiments, chemical reagents are used to probe RNA backbone flexibility at the nucleotide level, which in turn provides information on base pairing and therefore secondary structure. Little is known, however, about the statistics of such SHAPE data. In this work, we explore different representations of noise in SHAPE data and propose a statistically sound framework for extracting reliable reactivity information from multiple SHAPE replicates. Our analyses of RNA SHAPE experiments underscore that a normal noise model is not adequate to represent their data. We propose instead a log-normal representation of noise and discuss its relevance. Under this assumption, we observe that processing simulated SHAPE data by directly averaging different replicates leads to bias. Such bias can be reduced by analyzing the data following a log transformation, either by log-averaging or Kalman filtering. Application of Kalman filtering has the additional advantage that a prior on the nucleotide reactivities can be introduced. We show that the performance of Kalman filtering is then directly dependent on the quality of that prior. We conclude the paper with guidelines on signal processing of RNA SHAPE data.

UR - http://www.scopus.com/inward/record.url?scp=85056932433&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85056932433&partnerID=8YFLogxK

U2 - 10.1371/journal.pone.0207029

DO - 10.1371/journal.pone.0207029

M3 - Article

C2 - 30462682

AN - SCOPUS:85056932433

VL - 13

SP - e0207029

JO - PLoS One

JF - PLoS One

SN - 1932-6203

IS - 11

ER -