Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: Application to RNA expression profiling of blood from Duchenne muscular dystrophy patients

Wynn L. Walker, Isaac H. Liao, Donald L. Gilbert, Brenda Wong, Katherine S. Pollard, Charles E. McCulloch, Lisa Lit, Frank R Sharp

Research output: Contribution to journalArticle

34 Citations (Scopus)

Abstract

Background: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch effects only when samples from all biological groups are represented in every batch. Results: In this report we describe a generalized empirical Bayes approach to correct for cross-experimental batch effects, allowing direct comparisons of gene expression between biological groups from independent experiments. The proposed experimental design uses identical reference samples in each batch in every experiment. These reference samples are from the same tissue as the experimental samples. This design with tissue matched reference samples allows a gene-by-gene correction to be performed using fewer arrays than currently available methods. We examine the effects of non-biological variation within a single experiment and between experiments. Conclusion: Batch correction has a significant impact on which genes are identified as differentially regulated. Using this method, gene expression in the blood of patients with Duchenne Muscular Dystrophy is shown to differ for hundreds of genes when compared to controls. The numbers of specific genes differ depending upon whether between experiment and/or between batch corrections are performed.

Original languageEnglish (US)
Article number494
JournalBMC Genomics
Volume9
DOIs
StatePublished - Oct 20 2008

Fingerprint

Duchenne Muscular Dystrophy
RNA
Genes
Gene Expression
Research Design

ASJC Scopus subject areas

  • Biotechnology
  • Genetics

Cite this

Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples : Application to RNA expression profiling of blood from Duchenne muscular dystrophy patients. / Walker, Wynn L.; Liao, Isaac H.; Gilbert, Donald L.; Wong, Brenda; Pollard, Katherine S.; McCulloch, Charles E.; Lit, Lisa; Sharp, Frank R.

In: BMC Genomics, Vol. 9, 494, 20.10.2008.

Research output: Contribution to journalArticle

Walker, Wynn L. ; Liao, Isaac H. ; Gilbert, Donald L. ; Wong, Brenda ; Pollard, Katherine S. ; McCulloch, Charles E. ; Lit, Lisa ; Sharp, Frank R. / Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples : Application to RNA expression profiling of blood from Duchenne muscular dystrophy patients. In: BMC Genomics. 2008 ; Vol. 9.
@article{d3ef6c7ba2c24df4b852b1ff7ba8bb18,
title = "Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples: Application to RNA expression profiling of blood from Duchenne muscular dystrophy patients",
abstract = "Background: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch effects only when samples from all biological groups are represented in every batch. Results: In this report we describe a generalized empirical Bayes approach to correct for cross-experimental batch effects, allowing direct comparisons of gene expression between biological groups from independent experiments. The proposed experimental design uses identical reference samples in each batch in every experiment. These reference samples are from the same tissue as the experimental samples. This design with tissue matched reference samples allows a gene-by-gene correction to be performed using fewer arrays than currently available methods. We examine the effects of non-biological variation within a single experiment and between experiments. Conclusion: Batch correction has a significant impact on which genes are identified as differentially regulated. Using this method, gene expression in the blood of patients with Duchenne Muscular Dystrophy is shown to differ for hundreds of genes when compared to controls. The numbers of specific genes differ depending upon whether between experiment and/or between batch corrections are performed.",
author = "Walker, {Wynn L.} and Liao, {Isaac H.} and Gilbert, {Donald L.} and Brenda Wong and Pollard, {Katherine S.} and McCulloch, {Charles E.} and Lisa Lit and Sharp, {Frank R}",
year = "2008",
month = "10",
day = "20",
doi = "10.1186/1471-2164-9-494",
language = "English (US)",
volume = "9",
journal = "BMC Genomics",
issn = "1471-2164",
publisher = "BioMed Central",

}

TY - JOUR

T1 - Empirical Bayes accomodation of batch-effects in microarray data using identical replicate reference samples

T2 - Application to RNA expression profiling of blood from Duchenne muscular dystrophy patients

AU - Walker, Wynn L.

AU - Liao, Isaac H.

AU - Gilbert, Donald L.

AU - Wong, Brenda

AU - Pollard, Katherine S.

AU - McCulloch, Charles E.

AU - Lit, Lisa

AU - Sharp, Frank R

PY - 2008/10/20

Y1 - 2008/10/20

N2 - Background: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch effects only when samples from all biological groups are represented in every batch. Results: In this report we describe a generalized empirical Bayes approach to correct for cross-experimental batch effects, allowing direct comparisons of gene expression between biological groups from independent experiments. The proposed experimental design uses identical reference samples in each batch in every experiment. These reference samples are from the same tissue as the experimental samples. This design with tissue matched reference samples allows a gene-by-gene correction to be performed using fewer arrays than currently available methods. We examine the effects of non-biological variation within a single experiment and between experiments. Conclusion: Batch correction has a significant impact on which genes are identified as differentially regulated. Using this method, gene expression in the blood of patients with Duchenne Muscular Dystrophy is shown to differ for hundreds of genes when compared to controls. The numbers of specific genes differ depending upon whether between experiment and/or between batch corrections are performed.

AB - Background: Non-biological experimental error routinely occurs in microarray data collected in different batches. It is often impossible to compare groups of samples from independent experiments because batch effects confound true gene expression differences. Existing methods can correct for batch effects only when samples from all biological groups are represented in every batch. Results: In this report we describe a generalized empirical Bayes approach to correct for cross-experimental batch effects, allowing direct comparisons of gene expression between biological groups from independent experiments. The proposed experimental design uses identical reference samples in each batch in every experiment. These reference samples are from the same tissue as the experimental samples. This design with tissue matched reference samples allows a gene-by-gene correction to be performed using fewer arrays than currently available methods. We examine the effects of non-biological variation within a single experiment and between experiments. Conclusion: Batch correction has a significant impact on which genes are identified as differentially regulated. Using this method, gene expression in the blood of patients with Duchenne Muscular Dystrophy is shown to differ for hundreds of genes when compared to controls. The numbers of specific genes differ depending upon whether between experiment and/or between batch corrections are performed.

UR - http://www.scopus.com/inward/record.url?scp=55249088991&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=55249088991&partnerID=8YFLogxK

U2 - 10.1186/1471-2164-9-494

DO - 10.1186/1471-2164-9-494

M3 - Article

C2 - 18937867

AN - SCOPUS:55249088991

VL - 9

JO - BMC Genomics

JF - BMC Genomics

SN - 1471-2164

M1 - 494

ER -