Reproducible bioinformatics research for biologists

Likit Preeyanon, Alexis Black Pyrkosz, Charles Brown

Research output: Chapter in Book/Report/Conference proceedingChapter

3 Scopus citations

Abstract

At the dawn of computational biology in the 1960s, datasets were small. Protein sequences were first distributed in the printed Dayhoff atlases [29] and later on CD-ROM, with bioinformaticians eyeballing entire datasets and shuffling data by hand. By the 1990s, bioinformaticians were using spreadsheet programs and scientific software packages to analyze increasingly large datasets that included several phage and bacterial genomes. In 2003, the pregenomic era ended with the online publication of the human genome [7,14,26] and the National Institutes of Health invested heavily in sequencing related organisms to aid in annotation. By the mid-2000s, Sanger sequencing was replaced by faster and cheaper next-generation sequencing technologies, resulting in an explosion of data, with bioinformaticians racing to develop automated and scalable computational tools to analyze and mine it [3].

Original languageEnglish (US)
Title of host publicationImplementing Reproducible Research
PublisherCRC Press
Pages185-218
Number of pages34
ISBN (Electronic)9781466561601
ISBN (Print)9781466561595
DOIs
StatePublished - Jan 1 2014
Externally publishedYes

ASJC Scopus subject areas

  • Mathematics(all)

Fingerprint Dive into the research topics of 'Reproducible bioinformatics research for biologists'. Together they form a unique fingerprint.

  • Cite this

    Preeyanon, L., Pyrkosz, A. B., & Brown, C. (2014). Reproducible bioinformatics research for biologists. In Implementing Reproducible Research (pp. 185-218). CRC Press. https://doi.org/10.1201/b16868