Reproducible bioinformatics research for biologists

Likit Preeyanon, Alexis Black Pyrkosz, Charles Brown

Research output: Chapter in Book/Report/Conference proceedingChapter

3 Citations (Scopus)

Abstract

At the dawn of computational biology in the 1960s, datasets were small. Protein sequences were first distributed in the printed Dayhoff atlases [29] and later on CD-ROM, with bioinformaticians eyeballing entire datasets and shuffling data by hand. By the 1990s, bioinformaticians were using spreadsheet programs and scientific software packages to analyze increasingly large datasets that included several phage and bacterial genomes. In 2003, the pregenomic era ended with the online publication of the human genome [7,14,26] and the National Institutes of Health invested heavily in sequencing related organisms to aid in annotation. By the mid-2000s, Sanger sequencing was replaced by faster and cheaper next-generation sequencing technologies, resulting in an explosion of data, with bioinformaticians racing to develop automated and scalable computational tools to analyze and mine it [3].

Original languageEnglish (US)
Title of host publicationImplementing Reproducible Research
PublisherCRC Press
Pages185-218
Number of pages34
ISBN (Electronic)9781466561601
ISBN (Print)9781466561595
DOIs
StatePublished - Jan 1 2014
Externally publishedYes

Fingerprint

Sequencing
Bioinformatics
Genome
Spreadsheet
Atlas
Computational Biology
Protein Sequence
Software Package
Large Data Sets
Explosion
Annotation
Health
Entire
Human

ASJC Scopus subject areas

  • Mathematics(all)

Cite this

Preeyanon, L., Pyrkosz, A. B., & Brown, C. (2014). Reproducible bioinformatics research for biologists. In Implementing Reproducible Research (pp. 185-218). CRC Press. https://doi.org/10.1201/b16868

Reproducible bioinformatics research for biologists. / Preeyanon, Likit; Pyrkosz, Alexis Black; Brown, Charles.

Implementing Reproducible Research. CRC Press, 2014. p. 185-218.

Research output: Chapter in Book/Report/Conference proceedingChapter

Preeyanon, L, Pyrkosz, AB & Brown, C 2014, Reproducible bioinformatics research for biologists. in Implementing Reproducible Research. CRC Press, pp. 185-218. https://doi.org/10.1201/b16868
Preeyanon L, Pyrkosz AB, Brown C. Reproducible bioinformatics research for biologists. In Implementing Reproducible Research. CRC Press. 2014. p. 185-218 https://doi.org/10.1201/b16868
Preeyanon, Likit ; Pyrkosz, Alexis Black ; Brown, Charles. / Reproducible bioinformatics research for biologists. Implementing Reproducible Research. CRC Press, 2014. pp. 185-218
@inbook{3ca2342ab674403c9e18027c20e521a4,
title = "Reproducible bioinformatics research for biologists",
abstract = "At the dawn of computational biology in the 1960s, datasets were small. Protein sequences were first distributed in the printed Dayhoff atlases [29] and later on CD-ROM, with bioinformaticians eyeballing entire datasets and shuffling data by hand. By the 1990s, bioinformaticians were using spreadsheet programs and scientific software packages to analyze increasingly large datasets that included several phage and bacterial genomes. In 2003, the pregenomic era ended with the online publication of the human genome [7,14,26] and the National Institutes of Health invested heavily in sequencing related organisms to aid in annotation. By the mid-2000s, Sanger sequencing was replaced by faster and cheaper next-generation sequencing technologies, resulting in an explosion of data, with bioinformaticians racing to develop automated and scalable computational tools to analyze and mine it [3].",
author = "Likit Preeyanon and Pyrkosz, {Alexis Black} and Charles Brown",
year = "2014",
month = "1",
day = "1",
doi = "10.1201/b16868",
language = "English (US)",
isbn = "9781466561595",
pages = "185--218",
booktitle = "Implementing Reproducible Research",
publisher = "CRC Press",

}

TY - CHAP

T1 - Reproducible bioinformatics research for biologists

AU - Preeyanon, Likit

AU - Pyrkosz, Alexis Black

AU - Brown, Charles

PY - 2014/1/1

Y1 - 2014/1/1

N2 - At the dawn of computational biology in the 1960s, datasets were small. Protein sequences were first distributed in the printed Dayhoff atlases [29] and later on CD-ROM, with bioinformaticians eyeballing entire datasets and shuffling data by hand. By the 1990s, bioinformaticians were using spreadsheet programs and scientific software packages to analyze increasingly large datasets that included several phage and bacterial genomes. In 2003, the pregenomic era ended with the online publication of the human genome [7,14,26] and the National Institutes of Health invested heavily in sequencing related organisms to aid in annotation. By the mid-2000s, Sanger sequencing was replaced by faster and cheaper next-generation sequencing technologies, resulting in an explosion of data, with bioinformaticians racing to develop automated and scalable computational tools to analyze and mine it [3].

AB - At the dawn of computational biology in the 1960s, datasets were small. Protein sequences were first distributed in the printed Dayhoff atlases [29] and later on CD-ROM, with bioinformaticians eyeballing entire datasets and shuffling data by hand. By the 1990s, bioinformaticians were using spreadsheet programs and scientific software packages to analyze increasingly large datasets that included several phage and bacterial genomes. In 2003, the pregenomic era ended with the online publication of the human genome [7,14,26] and the National Institutes of Health invested heavily in sequencing related organisms to aid in annotation. By the mid-2000s, Sanger sequencing was replaced by faster and cheaper next-generation sequencing technologies, resulting in an explosion of data, with bioinformaticians racing to develop automated and scalable computational tools to analyze and mine it [3].

UR - http://www.scopus.com/inward/record.url?scp=85050151696&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85050151696&partnerID=8YFLogxK

U2 - 10.1201/b16868

DO - 10.1201/b16868

M3 - Chapter

SN - 9781466561595

SP - 185

EP - 218

BT - Implementing Reproducible Research

PB - CRC Press

ER -