Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry

Susan L Stewart, Karen C. Swallen, Sally L. Glaser, Pamela L. Horn-Ross, Dee W. West

Research output: Contribution to journalArticle

101 Citations (Scopus)

Abstract

The accuracy of ethnic classification can substantially affect ethnic- specific cancer statistics. In the Greater Bay Area Cancer Registry, which is part of the Surveillance, Epidemiology, and End Results (SEER) Program and of the statewide California Cancer Registry, Hispanic ethnicity is determined by medical record review and by matching to surname lists. This study compared these classification methods with self-report. Ethnic self-identification was obtained by surveying 1,154 area residents aged 20-89 years who were diagnosed with cancer in 1990 and were reported to the registry as being Hispanic or White non-Hispanic. Predictive value positive, sensitivity, and relative bias were used to assess the accuracy of Hispanic classification by medical record and surname. Among those persons classified as Hispanic by either or both of these sources, only two-thirds agreed (predictive value positive = 66%), and many self-identified Hispanics were classified incorrectly (sensitivity = 68%). Classification based on either medical record or surname alone had a lower sensitivity (59% and 61%, respectively) but a higher predictive value positive (77% and 70%, respectively). Ethnic classification by medical record alone resulted in an underestimate of Hispanic cancer cases and incidence rates. Bias was reduced when medical records and surnames were used together to classify cancer cases as Hispanic.

Original languageEnglish (US)
Pages (from-to)1063-1071
Number of pages9
JournalAmerican Journal of Epidemiology
Volume149
Issue number11
StatePublished - Jun 1 1999
Externally publishedYes

Fingerprint

Hispanic Americans
Registries
Medical Records
Population
Neoplasms
SEER Program
Self Report
Incidence

Keywords

  • Bias (epidemiology)
  • Classification
  • Ethnic groups
  • Hispanic Americans
  • Incidence
  • Neoplasms
  • Population studies
  • SEER program

ASJC Scopus subject areas

  • Epidemiology

Cite this

Stewart, S. L., Swallen, K. C., Glaser, S. L., Horn-Ross, P. L., & West, D. W. (1999). Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry. American Journal of Epidemiology, 149(11), 1063-1071.

Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry. / Stewart, Susan L; Swallen, Karen C.; Glaser, Sally L.; Horn-Ross, Pamela L.; West, Dee W.

In: American Journal of Epidemiology, Vol. 149, No. 11, 01.06.1999, p. 1063-1071.

Research output: Contribution to journalArticle

Stewart, SL, Swallen, KC, Glaser, SL, Horn-Ross, PL & West, DW 1999, 'Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry', American Journal of Epidemiology, vol. 149, no. 11, pp. 1063-1071.
Stewart, Susan L ; Swallen, Karen C. ; Glaser, Sally L. ; Horn-Ross, Pamela L. ; West, Dee W. / Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry. In: American Journal of Epidemiology. 1999 ; Vol. 149, No. 11. pp. 1063-1071.
@article{a09af50d17df4505b9c7aa86aa6492f3,
title = "Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry",
abstract = "The accuracy of ethnic classification can substantially affect ethnic- specific cancer statistics. In the Greater Bay Area Cancer Registry, which is part of the Surveillance, Epidemiology, and End Results (SEER) Program and of the statewide California Cancer Registry, Hispanic ethnicity is determined by medical record review and by matching to surname lists. This study compared these classification methods with self-report. Ethnic self-identification was obtained by surveying 1,154 area residents aged 20-89 years who were diagnosed with cancer in 1990 and were reported to the registry as being Hispanic or White non-Hispanic. Predictive value positive, sensitivity, and relative bias were used to assess the accuracy of Hispanic classification by medical record and surname. Among those persons classified as Hispanic by either or both of these sources, only two-thirds agreed (predictive value positive = 66{\%}), and many self-identified Hispanics were classified incorrectly (sensitivity = 68{\%}). Classification based on either medical record or surname alone had a lower sensitivity (59{\%} and 61{\%}, respectively) but a higher predictive value positive (77{\%} and 70{\%}, respectively). Ethnic classification by medical record alone resulted in an underestimate of Hispanic cancer cases and incidence rates. Bias was reduced when medical records and surnames were used together to classify cancer cases as Hispanic.",
keywords = "Bias (epidemiology), Classification, Ethnic groups, Hispanic Americans, Incidence, Neoplasms, Population studies, SEER program",
author = "Stewart, {Susan L} and Swallen, {Karen C.} and Glaser, {Sally L.} and Horn-Ross, {Pamela L.} and West, {Dee W.}",
year = "1999",
month = "6",
day = "1",
language = "English (US)",
volume = "149",
pages = "1063--1071",
journal = "American Journal of Epidemiology",
issn = "0002-9262",
publisher = "Oxford University Press",
number = "11",

}

TY - JOUR

T1 - Comparison of methods for classifying Hispanic ethnicity in a population-based cancer registry

AU - Stewart, Susan L

AU - Swallen, Karen C.

AU - Glaser, Sally L.

AU - Horn-Ross, Pamela L.

AU - West, Dee W.

PY - 1999/6/1

Y1 - 1999/6/1

N2 - The accuracy of ethnic classification can substantially affect ethnic- specific cancer statistics. In the Greater Bay Area Cancer Registry, which is part of the Surveillance, Epidemiology, and End Results (SEER) Program and of the statewide California Cancer Registry, Hispanic ethnicity is determined by medical record review and by matching to surname lists. This study compared these classification methods with self-report. Ethnic self-identification was obtained by surveying 1,154 area residents aged 20-89 years who were diagnosed with cancer in 1990 and were reported to the registry as being Hispanic or White non-Hispanic. Predictive value positive, sensitivity, and relative bias were used to assess the accuracy of Hispanic classification by medical record and surname. Among those persons classified as Hispanic by either or both of these sources, only two-thirds agreed (predictive value positive = 66%), and many self-identified Hispanics were classified incorrectly (sensitivity = 68%). Classification based on either medical record or surname alone had a lower sensitivity (59% and 61%, respectively) but a higher predictive value positive (77% and 70%, respectively). Ethnic classification by medical record alone resulted in an underestimate of Hispanic cancer cases and incidence rates. Bias was reduced when medical records and surnames were used together to classify cancer cases as Hispanic.

AB - The accuracy of ethnic classification can substantially affect ethnic- specific cancer statistics. In the Greater Bay Area Cancer Registry, which is part of the Surveillance, Epidemiology, and End Results (SEER) Program and of the statewide California Cancer Registry, Hispanic ethnicity is determined by medical record review and by matching to surname lists. This study compared these classification methods with self-report. Ethnic self-identification was obtained by surveying 1,154 area residents aged 20-89 years who were diagnosed with cancer in 1990 and were reported to the registry as being Hispanic or White non-Hispanic. Predictive value positive, sensitivity, and relative bias were used to assess the accuracy of Hispanic classification by medical record and surname. Among those persons classified as Hispanic by either or both of these sources, only two-thirds agreed (predictive value positive = 66%), and many self-identified Hispanics were classified incorrectly (sensitivity = 68%). Classification based on either medical record or surname alone had a lower sensitivity (59% and 61%, respectively) but a higher predictive value positive (77% and 70%, respectively). Ethnic classification by medical record alone resulted in an underestimate of Hispanic cancer cases and incidence rates. Bias was reduced when medical records and surnames were used together to classify cancer cases as Hispanic.

KW - Bias (epidemiology)

KW - Classification

KW - Ethnic groups

KW - Hispanic Americans

KW - Incidence

KW - Neoplasms

KW - Population studies

KW - SEER program

UR - http://www.scopus.com/inward/record.url?scp=0033151913&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=0033151913&partnerID=8YFLogxK

M3 - Article

C2 - 10355383

AN - SCOPUS:0033151913

VL - 149

SP - 1063

EP - 1071

JO - American Journal of Epidemiology

JF - American Journal of Epidemiology

SN - 0002-9262

IS - 11

ER -