Privacy-protecting, reliable response data discovery using COVID-19 patient observations

R2D2 Consortium

Research output: Contribution to journalArticlepeer-review

1 Scopus citations


Objective: To utilize, in an individual and institutional privacy-preserving manner, electronic health record (EHR) data from 202 hospitals by analyzing answers to COVID-19-related questions and posting these answers online. Materials and Methods: We developed a distributed, federated network of 12 health systems that harmonized their EHRs and submitted aggregate answers to consortia questions posted at Our consortium developed processes and implemented distributed algorithms to produce answers to a variety of questions. We were able to generate counts, descriptive statistics, and build a multivariate, iterative regression model without centralizing individual-level data. Results: Our public website contains answers to various clinical questions, a web form for users to ask questions in natural language, and a list of items that are currently pending responses. The results show, for example, that patients who were taking angiotensin-converting enzyme inhibitors and angiotensin II receptor blockers, within the year before admission, had lower unadjusted in-hospital mortality rates. We also showed that, when adjusted for, age, sex, and ethnicity were not significantly associated with mortality. We demonstrated that it is possible to answer questions about COVID-19 using EHR data from systems that have different policies and must follow various regulations, without moving data out of their health systems. Discussion and Conclusions: We present an alternative or a complement to centralized COVID-19 registries of EHR data. We can use multivariate distributed logistic regression on observations recorded in the process of care to generate results without transferring individual-level data outside the health systems.

Original languageEnglish (US)
Pages (from-to)1765-1776
Number of pages12
JournalJournal of the American Medical Informatics Association
Issue number8
StatePublished - Aug 2021


  • common data elements
  • COVID-19
  • electronic health record
  • observational study
  • regression analysis

ASJC Scopus subject areas

  • Health Informatics


Dive into the research topics of 'Privacy-protecting, reliable response data discovery using COVID-19 patient observations'. Together they form a unique fingerprint.

Cite this