Power to Detect Spatial Disturbances under Different Levels of Geographic Aggregation

Caroline Jeffery, Al Ozonoff, Laura F. White, Miriam A Nuno, Marcello Pagano

Research output: Contribution to journalArticle

16 Citations (Scopus)

Abstract

Objective: Spatio and/or temporal surveillance systems are designed to monitor the ongoing appearance of disease cases in space and time, and to detect potential disturbances in either dimension. Patient addresses are sometimes reported at some level of geographic aggregation, for example by ZIP code or census tract. While this aggregation has the advantage of protecting patient privacy, it also risks compromising statistical efficiency. This paper investigated the variation in power to detect a change in the spatial distribution in the presence of spatial aggregation. Methods: The authors generated 400,000 spatial datasets with varying location and spread of simulated spatial disturbances, both on a purely synthetic uniform population, and on a heterogeneous population, representing hospital admissions to three community hospitals in Cape Cod, Massachusetts. The authors evaluated the power of the M-statistic to detect spatial disturbances, comparing the use of exact spatial locations versus twelve different levels of aggregation, where the M-statistic is a comparison of two distributions of interpoint distances between locations. Results: When the spread of simulated spatial disturbances was contained to a small portion of the study region or affects a large proportion of the population at risk, power was highest when exact locations were reported. If the spatial disturbance was a more modest signal, the best power was attained at an aggregated level. Conclusions: The precision at which patients' locations are reported has the potential to affect the power of detection significantly.

Original languageEnglish (US)
Pages (from-to)847-854
Number of pages8
JournalJournal of the American Medical Informatics Association
Volume16
Issue number6
DOIs
StatePublished - Nov 1 2009
Externally publishedYes

Fingerprint

Gadiformes
Privacy
Community Hospital
Censuses
Population
Datasets

ASJC Scopus subject areas

  • Health Informatics

Cite this

Power to Detect Spatial Disturbances under Different Levels of Geographic Aggregation. / Jeffery, Caroline; Ozonoff, Al; White, Laura F.; Nuno, Miriam A; Pagano, Marcello.

In: Journal of the American Medical Informatics Association, Vol. 16, No. 6, 01.11.2009, p. 847-854.

Research output: Contribution to journalArticle

Jeffery, Caroline ; Ozonoff, Al ; White, Laura F. ; Nuno, Miriam A ; Pagano, Marcello. / Power to Detect Spatial Disturbances under Different Levels of Geographic Aggregation. In: Journal of the American Medical Informatics Association. 2009 ; Vol. 16, No. 6. pp. 847-854.
@article{3ce59a41325444598ae568e8b1dcbcbc,
title = "Power to Detect Spatial Disturbances under Different Levels of Geographic Aggregation",
abstract = "Objective: Spatio and/or temporal surveillance systems are designed to monitor the ongoing appearance of disease cases in space and time, and to detect potential disturbances in either dimension. Patient addresses are sometimes reported at some level of geographic aggregation, for example by ZIP code or census tract. While this aggregation has the advantage of protecting patient privacy, it also risks compromising statistical efficiency. This paper investigated the variation in power to detect a change in the spatial distribution in the presence of spatial aggregation. Methods: The authors generated 400,000 spatial datasets with varying location and spread of simulated spatial disturbances, both on a purely synthetic uniform population, and on a heterogeneous population, representing hospital admissions to three community hospitals in Cape Cod, Massachusetts. The authors evaluated the power of the M-statistic to detect spatial disturbances, comparing the use of exact spatial locations versus twelve different levels of aggregation, where the M-statistic is a comparison of two distributions of interpoint distances between locations. Results: When the spread of simulated spatial disturbances was contained to a small portion of the study region or affects a large proportion of the population at risk, power was highest when exact locations were reported. If the spatial disturbance was a more modest signal, the best power was attained at an aggregated level. Conclusions: The precision at which patients' locations are reported has the potential to affect the power of detection significantly.",
author = "Caroline Jeffery and Al Ozonoff and White, {Laura F.} and Nuno, {Miriam A} and Marcello Pagano",
year = "2009",
month = "11",
day = "1",
doi = "10.1197/jamia.M2788",
language = "English (US)",
volume = "16",
pages = "847--854",
journal = "Journal of the American Medical Informatics Association : JAMIA",
issn = "1067-5027",
publisher = "Oxford University Press",
number = "6",

}

TY - JOUR

T1 - Power to Detect Spatial Disturbances under Different Levels of Geographic Aggregation

AU - Jeffery, Caroline

AU - Ozonoff, Al

AU - White, Laura F.

AU - Nuno, Miriam A

AU - Pagano, Marcello

PY - 2009/11/1

Y1 - 2009/11/1

N2 - Objective: Spatio and/or temporal surveillance systems are designed to monitor the ongoing appearance of disease cases in space and time, and to detect potential disturbances in either dimension. Patient addresses are sometimes reported at some level of geographic aggregation, for example by ZIP code or census tract. While this aggregation has the advantage of protecting patient privacy, it also risks compromising statistical efficiency. This paper investigated the variation in power to detect a change in the spatial distribution in the presence of spatial aggregation. Methods: The authors generated 400,000 spatial datasets with varying location and spread of simulated spatial disturbances, both on a purely synthetic uniform population, and on a heterogeneous population, representing hospital admissions to three community hospitals in Cape Cod, Massachusetts. The authors evaluated the power of the M-statistic to detect spatial disturbances, comparing the use of exact spatial locations versus twelve different levels of aggregation, where the M-statistic is a comparison of two distributions of interpoint distances between locations. Results: When the spread of simulated spatial disturbances was contained to a small portion of the study region or affects a large proportion of the population at risk, power was highest when exact locations were reported. If the spatial disturbance was a more modest signal, the best power was attained at an aggregated level. Conclusions: The precision at which patients' locations are reported has the potential to affect the power of detection significantly.

AB - Objective: Spatio and/or temporal surveillance systems are designed to monitor the ongoing appearance of disease cases in space and time, and to detect potential disturbances in either dimension. Patient addresses are sometimes reported at some level of geographic aggregation, for example by ZIP code or census tract. While this aggregation has the advantage of protecting patient privacy, it also risks compromising statistical efficiency. This paper investigated the variation in power to detect a change in the spatial distribution in the presence of spatial aggregation. Methods: The authors generated 400,000 spatial datasets with varying location and spread of simulated spatial disturbances, both on a purely synthetic uniform population, and on a heterogeneous population, representing hospital admissions to three community hospitals in Cape Cod, Massachusetts. The authors evaluated the power of the M-statistic to detect spatial disturbances, comparing the use of exact spatial locations versus twelve different levels of aggregation, where the M-statistic is a comparison of two distributions of interpoint distances between locations. Results: When the spread of simulated spatial disturbances was contained to a small portion of the study region or affects a large proportion of the population at risk, power was highest when exact locations were reported. If the spatial disturbance was a more modest signal, the best power was attained at an aggregated level. Conclusions: The precision at which patients' locations are reported has the potential to affect the power of detection significantly.

UR - http://www.scopus.com/inward/record.url?scp=70350465156&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=70350465156&partnerID=8YFLogxK

U2 - 10.1197/jamia.M2788

DO - 10.1197/jamia.M2788

M3 - Article

C2 - 19717807

AN - SCOPUS:70350465156

VL - 16

SP - 847

EP - 854

JO - Journal of the American Medical Informatics Association : JAMIA

JF - Journal of the American Medical Informatics Association : JAMIA

SN - 1067-5027

IS - 6

ER -