Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm

Thomas J. Hoffmann, Yiping Zhan, Mark N. Kvale, Stephanie E. Hesselson, Jeremy Gollub, Carlos Iribarren, Yontao Lu, Gangwu Mei, Matthew M. Purdy, Charles Quesenberry, Sarah Rowell, Michael H. Shapero, David Smethurst, Carol P. Somkin, Stephen K. Van den Eeden, Larry Walter, Teresa Webster, Rachel Whitmer, Andrea Finn, Catherine SchaeferPui Yan Kwok, Neil Risch

Research output: Contribution to journalArticle

85 Scopus citations

Abstract

Four custom Axiom genotyping arrays were designed for a genome-wide association (GWA) study of 100,000 participants from the Kaiser Permanente Research Program on Genes, Environment and Health. The array optimized for individuals of European race/ethnicity was previously described. Here we detail the development of three additional microarrays optimized for individuals of East Asian, African American, and Latino race/ethnicity. For these arrays, we decreased redundancy of high-performing SNPs to increase SNP capacity. The East Asian array was designed using greedy pairwise SNP selection. However, removing SNPs from the target set based on imputation coverage is more efficient than pairwise tagging. Therefore, we developed a novel hybrid SNP selection method for the African American and Latino arrays utilizing rounds of greedy pairwise SNP selection, followed by removal from the target set of SNPs covered by imputation. The arrays provide excellent genome-wide coverage and are valuable additions for large-scale GWA studies.

Original languageEnglish (US)
Pages (from-to)422-430
Number of pages9
JournalGenomics
Volume98
Issue number6
DOIs
StatePublished - Dec 1 2011
Externally publishedYes

    Fingerprint

Keywords

  • Coverage
  • Genome-wide association study
  • Imputation
  • Microarray
  • Single nucleotide polymorphism
  • Throughput

ASJC Scopus subject areas

  • Genetics

Cite this

Hoffmann, T. J., Zhan, Y., Kvale, M. N., Hesselson, S. E., Gollub, J., Iribarren, C., Lu, Y., Mei, G., Purdy, M. M., Quesenberry, C., Rowell, S., Shapero, M. H., Smethurst, D., Somkin, C. P., Van den Eeden, S. K., Walter, L., Webster, T., Whitmer, R., Finn, A., ... Risch, N. (2011). Design and coverage of high throughput genotyping arrays optimized for individuals of East Asian, African American, and Latino race/ethnicity using imputation and a novel hybrid SNP selection algorithm. Genomics, 98(6), 422-430. https://doi.org/10.1016/j.ygeno.2011.08.007