NYU Dataset

Variant-specific introduction and dispersal dynamics of SARS-CoV-2 in New York City – from Alpha to Omicron

UID: 10619
* Corresponding Author
Description
Utilizing genomic sequencing data collected from COVID-19 patients in New York City metropolitan area, investigators produced a comparison of the introduction and dispersal of the main SARS-CoV-2 variants (Alpha, Iota, Delta, and Omicron-BA.1) from 2020 through 2022. The analysis included 5,577 sequences obtained from samples collected as part of genomic surveillance at NYU Langone Health from December 1, 2020 to February 27, 2022; all publicly-available sequences collected within the study area in the GISAID database; and a set of ‘background’ sequences featured in the last North American Nextstrain build that was available on the last collection date of the considered variant in our data set to provide a broader global context of SARS-CoV-2 phylogenetic diversity. In total, the dataset includes 11,758 (Iota), 16,395 (Alpha), 60,019 (Delta), and 32,322 (Omicron-BA.1) sequences.
Timeframe
2020 - 2022
Geographic Coverage
New York (State) - Long Island
New York (State) - New York City
Subject of Study
Subject Domain
Keywords

Access

Restrictions
Free to All
Instructions
Data and code supporting this study can be accessed through the associated Github repository.
Access via GitHub

Data and code

Associated Publications
Data Type
Equipment Used
IDT xGen COVID Capture Panel
Illumina NovaSeq 6000
Software Used
bcl2fastq2 v2.20
BEAST v1.10.5
BWA v0.7.17
GATK v3.8
IQ-TREE v2.2.0.3
minimap2 v2.24
Pangolin v3.1.20
Sambamba v0.6.8
TreeTime v.0.7.4
Trimmomatic v0.36
Study Type
Observational
Dataset Format(s)
R, CSV, XML, TREE, SHP
Grant Support
G0E1420N/Research Foundation - Flanders
G098321N/Research Foundation - Flanders
C14/18/094/Internal Funds KU Leuven
F.4515.22/Fonds National de la Recherche Scientifique
874850/European Union Horizon 2020
725422/European Union Horizon 2020
206298/Z/17/Z/Wellcome Trust
2.5020.11/Fonds de la Recherche Scientifique de Belgique
Related Datasets