From short reads to chromosome-scale genome assemblies

Kyle Fletcher, Richard W Michelmore

Research output: Chapter in Book/Report/Conference proceedingChapter

Abstract

A high-quality, annotated genome assembly is the foundation for many downstream studies. However, obtaining such an assembly is a complex, reiterative process that requires the assimilation of high-quality data and combines different approaches and data types. While some software packages incorporating multiple steps of genome assembly are commercially available, they may not be flexible enough to be routinely applied to all organisms, particularly to nonmodel species such as pathogenic oomycetes and fungi. If researchers understand and apply the most appropriate, currently available tools for each step, it is possible to customize parameters and optimize results for their organism of study. Based on our experience of de novo assembly and annotation of several oomycete species, this chapter provides a modular workflow from processing of raw reads, to initial assembly generation, through optimization, chromosome-scale scaffolding and annotation, outlining input and output data as well as examples and alternative software used for each step. The accompanying Notes provide background information for each step as well as alternative options. The final result of this workflow could be an annotated, high-quality, validated, chromosome-scale assembly or a draft assembly of sufficient quality to meet specific needs of a project.

Original languageEnglish (US)
Title of host publicationMethods in Molecular Biology
PublisherHumana Press Inc.
Pages151-197
Number of pages47
DOIs
StatePublished - Jan 1 2018

Publication series

NameMethods in Molecular Biology
Volume1848
ISSN (Print)1064-3745

Fingerprint

Oomycetes
Workflow
Software
Chromosomes
Genome
Fungi
Research Personnel
Data Accuracy

Keywords

  • Assembly quality
  • Contig scaffolding
  • Genome annotation
  • Genome assembly
  • Hi-C
  • Next-generation sequencing
  • Optical maps

ASJC Scopus subject areas

  • Molecular Biology
  • Genetics

Cite this

Fletcher, K., & Michelmore, R. W. (2018). From short reads to chromosome-scale genome assemblies. In Methods in Molecular Biology (pp. 151-197). (Methods in Molecular Biology; Vol. 1848). Humana Press Inc.. https://doi.org/10.1007/978-1-4939-8724-5_13

From short reads to chromosome-scale genome assemblies. / Fletcher, Kyle; Michelmore, Richard W.

Methods in Molecular Biology. Humana Press Inc., 2018. p. 151-197 (Methods in Molecular Biology; Vol. 1848).

Research output: Chapter in Book/Report/Conference proceedingChapter

Fletcher, K & Michelmore, RW 2018, From short reads to chromosome-scale genome assemblies. in Methods in Molecular Biology. Methods in Molecular Biology, vol. 1848, Humana Press Inc., pp. 151-197. https://doi.org/10.1007/978-1-4939-8724-5_13
Fletcher K, Michelmore RW. From short reads to chromosome-scale genome assemblies. In Methods in Molecular Biology. Humana Press Inc. 2018. p. 151-197. (Methods in Molecular Biology). https://doi.org/10.1007/978-1-4939-8724-5_13
Fletcher, Kyle ; Michelmore, Richard W. / From short reads to chromosome-scale genome assemblies. Methods in Molecular Biology. Humana Press Inc., 2018. pp. 151-197 (Methods in Molecular Biology).
@inbook{c2bd733ed0cc4c5ab7eb1c47d95fdfb7,
title = "From short reads to chromosome-scale genome assemblies",
abstract = "A high-quality, annotated genome assembly is the foundation for many downstream studies. However, obtaining such an assembly is a complex, reiterative process that requires the assimilation of high-quality data and combines different approaches and data types. While some software packages incorporating multiple steps of genome assembly are commercially available, they may not be flexible enough to be routinely applied to all organisms, particularly to nonmodel species such as pathogenic oomycetes and fungi. If researchers understand and apply the most appropriate, currently available tools for each step, it is possible to customize parameters and optimize results for their organism of study. Based on our experience of de novo assembly and annotation of several oomycete species, this chapter provides a modular workflow from processing of raw reads, to initial assembly generation, through optimization, chromosome-scale scaffolding and annotation, outlining input and output data as well as examples and alternative software used for each step. The accompanying Notes provide background information for each step as well as alternative options. The final result of this workflow could be an annotated, high-quality, validated, chromosome-scale assembly or a draft assembly of sufficient quality to meet specific needs of a project.",
keywords = "Assembly quality, Contig scaffolding, Genome annotation, Genome assembly, Hi-C, Next-generation sequencing, Optical maps",
author = "Kyle Fletcher and Michelmore, {Richard W}",
year = "2018",
month = "1",
day = "1",
doi = "10.1007/978-1-4939-8724-5_13",
language = "English (US)",
series = "Methods in Molecular Biology",
publisher = "Humana Press Inc.",
pages = "151--197",
booktitle = "Methods in Molecular Biology",

}

TY - CHAP

T1 - From short reads to chromosome-scale genome assemblies

AU - Fletcher, Kyle

AU - Michelmore, Richard W

PY - 2018/1/1

Y1 - 2018/1/1

N2 - A high-quality, annotated genome assembly is the foundation for many downstream studies. However, obtaining such an assembly is a complex, reiterative process that requires the assimilation of high-quality data and combines different approaches and data types. While some software packages incorporating multiple steps of genome assembly are commercially available, they may not be flexible enough to be routinely applied to all organisms, particularly to nonmodel species such as pathogenic oomycetes and fungi. If researchers understand and apply the most appropriate, currently available tools for each step, it is possible to customize parameters and optimize results for their organism of study. Based on our experience of de novo assembly and annotation of several oomycete species, this chapter provides a modular workflow from processing of raw reads, to initial assembly generation, through optimization, chromosome-scale scaffolding and annotation, outlining input and output data as well as examples and alternative software used for each step. The accompanying Notes provide background information for each step as well as alternative options. The final result of this workflow could be an annotated, high-quality, validated, chromosome-scale assembly or a draft assembly of sufficient quality to meet specific needs of a project.

AB - A high-quality, annotated genome assembly is the foundation for many downstream studies. However, obtaining such an assembly is a complex, reiterative process that requires the assimilation of high-quality data and combines different approaches and data types. While some software packages incorporating multiple steps of genome assembly are commercially available, they may not be flexible enough to be routinely applied to all organisms, particularly to nonmodel species such as pathogenic oomycetes and fungi. If researchers understand and apply the most appropriate, currently available tools for each step, it is possible to customize parameters and optimize results for their organism of study. Based on our experience of de novo assembly and annotation of several oomycete species, this chapter provides a modular workflow from processing of raw reads, to initial assembly generation, through optimization, chromosome-scale scaffolding and annotation, outlining input and output data as well as examples and alternative software used for each step. The accompanying Notes provide background information for each step as well as alternative options. The final result of this workflow could be an annotated, high-quality, validated, chromosome-scale assembly or a draft assembly of sufficient quality to meet specific needs of a project.

KW - Assembly quality

KW - Contig scaffolding

KW - Genome annotation

KW - Genome assembly

KW - Hi-C

KW - Next-generation sequencing

KW - Optical maps

UR - http://www.scopus.com/inward/record.url?scp=85053003650&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85053003650&partnerID=8YFLogxK

U2 - 10.1007/978-1-4939-8724-5_13

DO - 10.1007/978-1-4939-8724-5_13

M3 - Chapter

T3 - Methods in Molecular Biology

SP - 151

EP - 197

BT - Methods in Molecular Biology

PB - Humana Press Inc.

ER -