CDC COVID-19 Case Surveillance Data

UID: 10427
Description

The Centers for Disease Control and Prevention (CDC) has released deidentified line-listed datasets based on COVID-19 cases reported to CDC. Data suppression is performed on low frequency records (<5), indirect identifiers, and uncommon combinations of demographic characteristics.

The public-use dataset includes 12 data elements assessing demographic characteristics, testing and reporting dates, health status, comorbidities, and disease outcomes. The restricted-access dataset contains an additional 20 elements (32 in total), including state and county of residence information, details on the delivery of care, and symptoms experienced. The datasets are updated monthly.

Publisher
Timeframe
2020 - Present
Geographic Coverage
Alabama
Alaska
Arizona
Arkansas
California
Colorado
Connecticut
Delaware
Florida
Georgia
Hawaii
Idaho
Illinois
Indiana
Iowa
Kansas
Kentucky
Louisiana
Maine
Maryland
Massachusetts
Michigan
Minnesota
Mississippi
Missouri
Montana
Nebraska
Nevada
New Hampshire
New Jersey
New Mexico
New York (State)
North Carolina
North Dakota
Ohio
Oklahoma
Oregon
Pennsylvania
Rhode Island
South Carolina
South Dakota
Tennessee
Texas
United States
Utah
Vermont
Virginia
Washington (State)
Washington, D.C.
West Virginia
Wisconsin
Wyoming
Subject of Study
Subject Domain
Keywords

Access

Restrictions
Free to All
Instructions

The public-use dataset can be downloaded from the CDC through the first access link.

Detailed instructions for requesting access to restricted data, which includes geographic and additional clinical information, may be found through the second access link.

Access via CDC

Public-use dataset

Access via CDC

Restricted-use dataset

Access via SODA API

Public-use dataset API

Data Type
Study Type
Observational
Dataset Format(s)
CSV, XML, JSON, RSS
Other Resources
Data Record

Links to Documentation

FAQ

Frequently Asked Questions