ISOLATE: A computational strategy for identifying the primary origin of cancers using high-throughput sequencing

Gerald Quon, Quaid Morris

Research output: Contribution to journalArticle

33 Scopus citations

Abstract

Motivation: One of the most deadly cancer diagnoses is the carcinoma of unknown primary origin. Without the knowledge of the site of origin, treatment regimens are limited in their specificity and result in high mortality rates. Though supervised classification methods have been developed to predict the site of origin based on gene expression data, they require large numbers of previously classified tumors for training, in part because they do not account for sample heterogeneity, which limits their application to well-studied cancers. Results: We present ISOLATE, a new statistical method that simultaneously predicts the primary site of origin of cancers and addresses sample heterogeneity, while taking advantage of new high-throughput sequencing technology that promises to bring higher accuracy and reproducibility to gene expression profiling experiments. ISOLATE makes predictions de novo, without having seen any training expression profiles of cancers with identified origin. Compared with previous methods, ISOLATE is able to predict the primary site of origin, de-convolve and remove the effect of sample heterogeneity and identify differentially expressed genes with higher accuracy, across both synthetic and clinical datasets. Methods such as ISOLATE are invaluable tools for clinicians faced with carcinomas of unknown primary origin.

Original languageEnglish (US)
Pages (from-to)2882-2889
Number of pages8
JournalBioinformatics
Volume25
Issue number21
DOIs
StatePublished - Nov 9 2009
Externally publishedYes

ASJC Scopus subject areas

  • Statistics and Probability
  • Medicine(all)
  • Biochemistry
  • Molecular Biology
  • Computer Science Applications
  • Computational Theory and Mathematics
  • Computational Mathematics

Fingerprint Dive into the research topics of 'ISOLATE: A computational strategy for identifying the primary origin of cancers using high-throughput sequencing'. Together they form a unique fingerprint.

  • Cite this