Privacy protection versus cluster detection in spatial epidemiology

Karen L. Olson, Shaun J. Grannis, Kenneth D. Mandl

Research output: Contribution to journalArticle

76 Citations (Scopus)

Abstract

Objectives. Patient data that includes precise locations can reveal patients' identities, whereas data aggregated into administrative regions may preserve privacy and confidentiality. We investigated the effect of varying degrees of address precision (exact latitude and longitude vs the center points of zip code or census tracts) on detection of spatial clusters of cases. Methods. We simulated disease outbreaks by adding supplementary spatially clustered emergency department visits to authentic hospital emergency department syndromic surveillance data. We identified clusters with a spatial scan statistic and evaluated detection rate and accuracy. Results. More clusters were identified, and clusters were more accurately detected, when exact locations were used. That is, these clusters contained at least half of the simulated points and involved few additional emergency department visits. These results were especially apparent when the synthetic clustered points crossed administrative boundaries and fell into multiple zip code or census tracts. Conclusions. The spatial cluster detection algorithm performed better when addresses were analyzed as exact locations than when they were analyzed as center points of zip code or census tracts, particularly when the clustered points crossed administrative boundaries. Use of precise addresses offers improved performance, but this practice must be weighed against privacy concerns in the establishment of public health data exchange policies.

Original languageEnglish (US)
Pages (from-to)2002-2008
Number of pages7
JournalAmerican journal of public health
Volume96
Issue number11
DOIs
StatePublished - Nov 1 2006

Fingerprint

Privacy
Censuses
Hospital Emergency Service
Epidemiology
Confidentiality
Hospital Departments
Disease Outbreaks
Public Health

ASJC Scopus subject areas

  • Public Health, Environmental and Occupational Health

Cite this

Privacy protection versus cluster detection in spatial epidemiology. / Olson, Karen L.; Grannis, Shaun J.; Mandl, Kenneth D.

In: American journal of public health, Vol. 96, No. 11, 01.11.2006, p. 2002-2008.

Research output: Contribution to journalArticle

Olson, Karen L. ; Grannis, Shaun J. ; Mandl, Kenneth D. / Privacy protection versus cluster detection in spatial epidemiology. In: American journal of public health. 2006 ; Vol. 96, No. 11. pp. 2002-2008.
@article{edbbbf33f5cb451e8ff02b7c05d1d0cb,
title = "Privacy protection versus cluster detection in spatial epidemiology",
abstract = "Objectives. Patient data that includes precise locations can reveal patients' identities, whereas data aggregated into administrative regions may preserve privacy and confidentiality. We investigated the effect of varying degrees of address precision (exact latitude and longitude vs the center points of zip code or census tracts) on detection of spatial clusters of cases. Methods. We simulated disease outbreaks by adding supplementary spatially clustered emergency department visits to authentic hospital emergency department syndromic surveillance data. We identified clusters with a spatial scan statistic and evaluated detection rate and accuracy. Results. More clusters were identified, and clusters were more accurately detected, when exact locations were used. That is, these clusters contained at least half of the simulated points and involved few additional emergency department visits. These results were especially apparent when the synthetic clustered points crossed administrative boundaries and fell into multiple zip code or census tracts. Conclusions. The spatial cluster detection algorithm performed better when addresses were analyzed as exact locations than when they were analyzed as center points of zip code or census tracts, particularly when the clustered points crossed administrative boundaries. Use of precise addresses offers improved performance, but this practice must be weighed against privacy concerns in the establishment of public health data exchange policies.",
author = "Olson, {Karen L.} and Grannis, {Shaun J.} and Mandl, {Kenneth D.}",
year = "2006",
month = "11",
day = "1",
doi = "10.2105/AJPH.2005.069526",
language = "English (US)",
volume = "96",
pages = "2002--2008",
journal = "American Journal of Public Health",
issn = "0090-0036",
publisher = "American Public Health Association Inc.",
number = "11",

}

TY - JOUR

T1 - Privacy protection versus cluster detection in spatial epidemiology

AU - Olson, Karen L.

AU - Grannis, Shaun J.

AU - Mandl, Kenneth D.

PY - 2006/11/1

Y1 - 2006/11/1

N2 - Objectives. Patient data that includes precise locations can reveal patients' identities, whereas data aggregated into administrative regions may preserve privacy and confidentiality. We investigated the effect of varying degrees of address precision (exact latitude and longitude vs the center points of zip code or census tracts) on detection of spatial clusters of cases. Methods. We simulated disease outbreaks by adding supplementary spatially clustered emergency department visits to authentic hospital emergency department syndromic surveillance data. We identified clusters with a spatial scan statistic and evaluated detection rate and accuracy. Results. More clusters were identified, and clusters were more accurately detected, when exact locations were used. That is, these clusters contained at least half of the simulated points and involved few additional emergency department visits. These results were especially apparent when the synthetic clustered points crossed administrative boundaries and fell into multiple zip code or census tracts. Conclusions. The spatial cluster detection algorithm performed better when addresses were analyzed as exact locations than when they were analyzed as center points of zip code or census tracts, particularly when the clustered points crossed administrative boundaries. Use of precise addresses offers improved performance, but this practice must be weighed against privacy concerns in the establishment of public health data exchange policies.

AB - Objectives. Patient data that includes precise locations can reveal patients' identities, whereas data aggregated into administrative regions may preserve privacy and confidentiality. We investigated the effect of varying degrees of address precision (exact latitude and longitude vs the center points of zip code or census tracts) on detection of spatial clusters of cases. Methods. We simulated disease outbreaks by adding supplementary spatially clustered emergency department visits to authentic hospital emergency department syndromic surveillance data. We identified clusters with a spatial scan statistic and evaluated detection rate and accuracy. Results. More clusters were identified, and clusters were more accurately detected, when exact locations were used. That is, these clusters contained at least half of the simulated points and involved few additional emergency department visits. These results were especially apparent when the synthetic clustered points crossed administrative boundaries and fell into multiple zip code or census tracts. Conclusions. The spatial cluster detection algorithm performed better when addresses were analyzed as exact locations than when they were analyzed as center points of zip code or census tracts, particularly when the clustered points crossed administrative boundaries. Use of precise addresses offers improved performance, but this practice must be weighed against privacy concerns in the establishment of public health data exchange policies.

UR - http://www.scopus.com/inward/record.url?scp=33750490794&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=33750490794&partnerID=8YFLogxK

U2 - 10.2105/AJPH.2005.069526

DO - 10.2105/AJPH.2005.069526

M3 - Article

C2 - 17018828

AN - SCOPUS:33750490794

VL - 96

SP - 2002

EP - 2008

JO - American Journal of Public Health

JF - American Journal of Public Health

SN - 0090-0036

IS - 11

ER -