National COVID Cohort Collaborative Data Enclave

Alternate Titles(s): N3C Data Enclave
UID: 10384

The National Center for Advancing Translational Sciences (NCATS) has systematically compiled clinical, laboratory and diagnostic data from electronic health records to support COVID-19 research efforts via the National COVID Cohort Collaborative (N3C) Data Enclave. As of August 2, 2022, the repository contains information from over 15 million patients (including 5.8 million COVID-19 positive patients) across the United States.

The N3C Data Enclave is organized into 3 levels of data with varying access restrictions:

  • Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
    • Can be requested by: Researchers from US-based or foreign institutions, and citizen scientists
  • De-identified dataset: Contains no PHI. This dataset consists of real patient data with shifted dates of service and truncated ZIP codes of patients residing in areas with populations above 20,000.
    • Can be requested by: Researchers from US-based or foreign institutions
  • Limited Data Set (LDS): Contains 2 PHI elements (dates of service and patient ZIP code). This dataset consists of real patient data.
    • Can be requested by: Researchers from US-based institutions only

2020 - Present
Geographic Coverage
United States
Local Expert
Subject of Study
Subject Domain
Population Age
Child (2 years - 12 years)
Adolescent (13 years - 18 years)
Adult (19 years - 64 years)
Senior (65 years - 79 years)
Aged (80 years and over)
Newborn (under 1 month)
Infant (1 month - 23 months)


Free to All
Application Required

N3C data is available at no cost through institutions that have executed a Data Use Agreement (DUA) with the N3C. A list of participating institutions can be found here.

NYU Grossman School of Medicine has executed a DUA with the N3C and researchers at the institution may register for an account with N3C to begin the onboarding process. After an account is created, researchers must log in and complete all mandatory training. Human subjects research training is required to access the Limited Data Set and de-identified dataset.

Next, researchers must submit the N3C Data Use Request Form (see detailed instructions) and attest to the Data Use Agreement and N3C Data User Code of Conduct. An IRB determination letter must also be submitted if requesting access to the Limited Data Set. The Data Use Request Form will be reviewed by the N3C Data Access Committee and their decision will be returned via email correspondence with further instructions for accessing the data.

Access via N3C

Access links and instructions

Data Type
Study Type
PubMed Search
View articles which use this dataset
Other Resources
N3C External Dataset Registry

Public datasets that can be linked with the N3C Data Enclave