National COVID Cohort Collaborative Data Enclave
Alternate Titles(s): N3C Data Enclave
- Description
The National Center for Advancing Translational Sciences (NCATS) has systematically compiled clinical, laboratory and diagnostic data from electronic health records to support COVID-19 research efforts via the National COVID Cohort Collaborative (N3C) Data Enclave. As of August 2, 2022, the repository contains information from over 15 million patients (including 5.8 million COVID-19 positive patients) across the United States.
The N3C Data Enclave is organized into 3 levels of data with varying access restrictions:
- Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
- Can be requested by: Researchers from US-based or foreign institutions, and citizen scientists
- De-identified dataset: Contains no PHI. This dataset consists of real patient data with shifted dates of service and truncated ZIP codes of patients residing in areas with populations above 20,000.
- Can be requested by: Researchers from US-based or foreign institutions
- Limited Data Set (LDS): Contains 2 PHI elements (dates of service and patient ZIP code). This dataset consists of real patient data.
- Can be requested by: Researchers from US-based institutions only
- Synthetic dataset: Contains no protected health information (PHI). This is a statistically-comparable artificial dataset derived from the original dataset.
- Publisher
- Timeframe
- 2020 - Present
- Geographic Coverage
-
United States
- Local Expert
Access
- Restrictions
-
Free to AllApplication Required
- Instructions
N3C data is available at no cost through institutions that have executed a Data Use Agreement (DUA) with the N3C. A list of participating institutions can be found here.
NYU Grossman School of Medicine has executed a DUA with the N3C and researchers at the institution may register for an account with N3C to begin the onboarding process. After an account is created, researchers must log in and complete all mandatory training. Human subjects research training is required to access the Limited Data Set and de-identified dataset.
Next, researchers must submit the N3C Data Use Request Form (see detailed instructions) and attest to the Data Use Agreement and N3C Data User Code of Conduct. An IRB determination letter must also be submitted if requesting access to the Limited Data Set. The Data Use Request Form will be reviewed by the N3C Data Access Committee and their decision will be returned via email correspondence with further instructions for accessing the data.
- PubMed Search
- View articles which use this dataset
- Other Resources
-
N3C External Dataset Registry
Public datasets that can be linked with the N3C Data Enclave