Assembly of the LongSHOT cohort: Public record linkage on a grand scale

Yifan Zhang, Erin E. Holsinger, Lea Prince, Jonathan A. Rodden, Sonja A. Swanson, Matthew M. Miller, Garen J. Wintemute, David M. Studdert

Research output: Contribution to journalArticle

Abstract

Background: Virtually all existing evidence linking access to firearms to elevated risks of mortality and morbidity comes from ecological and case-control studies. To improve understanding of the health risks and benefits of firearm ownership, we launched a cohort study: the Longitudinal Study of Handgun Ownership and Transfer (LongSHOT). Methods: Using probabilistic matching techniques we linked three sources of individual-level, state-wide data in California: official voter registration records, an archive of lawful handgun transactions and all-cause mortality data. There were nearly 28.8 million unique voter registrants, 5.5 million handgun transfers and 3.1 million deaths during the study period (18 October 2004 to 31 December 2016). The linkage relied on several identifying variables (first, middle and last names; date of birth; sex; residential address) that were available in all three data sets, deploying them in a series of bespoke algorithms. Results: Assembly of the LongSHOT cohort commenced in January 2016 and was completed in March 2019. Approximately three-quarters of matches identified were exact matches on all link variables. The cohort consists of 28.8 million adult residents of California followed for up to 12.2 years. A total of 1.2 million cohort members purchased at least one handgun during the study period, and 1.6 million died. Conclusions: Three steps taken early may be particularly useful in enhancing the efficiency of large-scale data linkage: thorough data cleaning; assessment of the suitability of off-the-shelf data linkage packages relative to bespoke coding; and careful consideration of the minimum sample size and matching precision needed to support rigorous investigation of the study questions.

Original languageEnglish (US)
JournalInjury Prevention
DOIs
StateAccepted/In press - Jan 1 2019

Fingerprint

Ownership
Longitudinal Studies
Information Storage and Retrieval
Firearms
Mortality
Insurance Benefits
Sample Size
Names
Case-Control Studies
Cohort Studies
Parturition
Morbidity

Keywords

  • cohort study
  • firearm
  • mortality
  • violence

ASJC Scopus subject areas

  • Public Health, Environmental and Occupational Health

Cite this

Zhang, Y., Holsinger, E. E., Prince, L., Rodden, J. A., Swanson, S. A., Miller, M. M., ... Studdert, D. M. (Accepted/In press). Assembly of the LongSHOT cohort: Public record linkage on a grand scale. Injury Prevention. https://doi.org/10.1136/injuryprev-2019-043385

Assembly of the LongSHOT cohort : Public record linkage on a grand scale. / Zhang, Yifan; Holsinger, Erin E.; Prince, Lea; Rodden, Jonathan A.; Swanson, Sonja A.; Miller, Matthew M.; Wintemute, Garen J.; Studdert, David M.

In: Injury Prevention, 01.01.2019.

Research output: Contribution to journalArticle

Zhang, Yifan ; Holsinger, Erin E. ; Prince, Lea ; Rodden, Jonathan A. ; Swanson, Sonja A. ; Miller, Matthew M. ; Wintemute, Garen J. ; Studdert, David M. / Assembly of the LongSHOT cohort : Public record linkage on a grand scale. In: Injury Prevention. 2019.
@article{61980c8ca21c4608a890ca1af0168b62,
title = "Assembly of the LongSHOT cohort: Public record linkage on a grand scale",
abstract = "Background: Virtually all existing evidence linking access to firearms to elevated risks of mortality and morbidity comes from ecological and case-control studies. To improve understanding of the health risks and benefits of firearm ownership, we launched a cohort study: the Longitudinal Study of Handgun Ownership and Transfer (LongSHOT). Methods: Using probabilistic matching techniques we linked three sources of individual-level, state-wide data in California: official voter registration records, an archive of lawful handgun transactions and all-cause mortality data. There were nearly 28.8 million unique voter registrants, 5.5 million handgun transfers and 3.1 million deaths during the study period (18 October 2004 to 31 December 2016). The linkage relied on several identifying variables (first, middle and last names; date of birth; sex; residential address) that were available in all three data sets, deploying them in a series of bespoke algorithms. Results: Assembly of the LongSHOT cohort commenced in January 2016 and was completed in March 2019. Approximately three-quarters of matches identified were exact matches on all link variables. The cohort consists of 28.8 million adult residents of California followed for up to 12.2 years. A total of 1.2 million cohort members purchased at least one handgun during the study period, and 1.6 million died. Conclusions: Three steps taken early may be particularly useful in enhancing the efficiency of large-scale data linkage: thorough data cleaning; assessment of the suitability of off-the-shelf data linkage packages relative to bespoke coding; and careful consideration of the minimum sample size and matching precision needed to support rigorous investigation of the study questions.",
keywords = "cohort study, firearm, mortality, violence",
author = "Yifan Zhang and Holsinger, {Erin E.} and Lea Prince and Rodden, {Jonathan A.} and Swanson, {Sonja A.} and Miller, {Matthew M.} and Wintemute, {Garen J.} and Studdert, {David M.}",
year = "2019",
month = "1",
day = "1",
doi = "10.1136/injuryprev-2019-043385",
language = "English (US)",
journal = "Injury Prevention",
issn = "1353-8047",
publisher = "BMJ Publishing Group",

}

TY - JOUR

T1 - Assembly of the LongSHOT cohort

T2 - Public record linkage on a grand scale

AU - Zhang, Yifan

AU - Holsinger, Erin E.

AU - Prince, Lea

AU - Rodden, Jonathan A.

AU - Swanson, Sonja A.

AU - Miller, Matthew M.

AU - Wintemute, Garen J.

AU - Studdert, David M.

PY - 2019/1/1

Y1 - 2019/1/1

N2 - Background: Virtually all existing evidence linking access to firearms to elevated risks of mortality and morbidity comes from ecological and case-control studies. To improve understanding of the health risks and benefits of firearm ownership, we launched a cohort study: the Longitudinal Study of Handgun Ownership and Transfer (LongSHOT). Methods: Using probabilistic matching techniques we linked three sources of individual-level, state-wide data in California: official voter registration records, an archive of lawful handgun transactions and all-cause mortality data. There were nearly 28.8 million unique voter registrants, 5.5 million handgun transfers and 3.1 million deaths during the study period (18 October 2004 to 31 December 2016). The linkage relied on several identifying variables (first, middle and last names; date of birth; sex; residential address) that were available in all three data sets, deploying them in a series of bespoke algorithms. Results: Assembly of the LongSHOT cohort commenced in January 2016 and was completed in March 2019. Approximately three-quarters of matches identified were exact matches on all link variables. The cohort consists of 28.8 million adult residents of California followed for up to 12.2 years. A total of 1.2 million cohort members purchased at least one handgun during the study period, and 1.6 million died. Conclusions: Three steps taken early may be particularly useful in enhancing the efficiency of large-scale data linkage: thorough data cleaning; assessment of the suitability of off-the-shelf data linkage packages relative to bespoke coding; and careful consideration of the minimum sample size and matching precision needed to support rigorous investigation of the study questions.

AB - Background: Virtually all existing evidence linking access to firearms to elevated risks of mortality and morbidity comes from ecological and case-control studies. To improve understanding of the health risks and benefits of firearm ownership, we launched a cohort study: the Longitudinal Study of Handgun Ownership and Transfer (LongSHOT). Methods: Using probabilistic matching techniques we linked three sources of individual-level, state-wide data in California: official voter registration records, an archive of lawful handgun transactions and all-cause mortality data. There were nearly 28.8 million unique voter registrants, 5.5 million handgun transfers and 3.1 million deaths during the study period (18 October 2004 to 31 December 2016). The linkage relied on several identifying variables (first, middle and last names; date of birth; sex; residential address) that were available in all three data sets, deploying them in a series of bespoke algorithms. Results: Assembly of the LongSHOT cohort commenced in January 2016 and was completed in March 2019. Approximately three-quarters of matches identified were exact matches on all link variables. The cohort consists of 28.8 million adult residents of California followed for up to 12.2 years. A total of 1.2 million cohort members purchased at least one handgun during the study period, and 1.6 million died. Conclusions: Three steps taken early may be particularly useful in enhancing the efficiency of large-scale data linkage: thorough data cleaning; assessment of the suitability of off-the-shelf data linkage packages relative to bespoke coding; and careful consideration of the minimum sample size and matching precision needed to support rigorous investigation of the study questions.

KW - cohort study

KW - firearm

KW - mortality

KW - violence

UR - http://www.scopus.com/inward/record.url?scp=85074356702&partnerID=8YFLogxK

UR - http://www.scopus.com/inward/citedby.url?scp=85074356702&partnerID=8YFLogxK

U2 - 10.1136/injuryprev-2019-043385

DO - 10.1136/injuryprev-2019-043385

M3 - Article

AN - SCOPUS:85074356702

JO - Injury Prevention

JF - Injury Prevention

SN - 1353-8047

ER -