Search here to find large public and licensed datasets

About the Data Catalog

The NYU Data Catalog facilitates researchers’ discovery of data by providing a searchable and browsable online collection of datasets. Rather than functioning as a data repository, the catalog is a digital way-finder for researchers looking for datasets relevant to their work. It includes datasets generated by NYU researchers as well as publically available and licensed datasets that are generated at external organizations, e.g. the Bureau of Labor Statistics.

The NYU Data Catalog is designed to:

  • Increase the visibility of research data generated by NYU researchers
  • Facilitate collaboration across departments and institutes at NYU
  • Help NYU researchers locate and understand datasets generated at external organizations
  • Support the process of re-using research data

If you are interested in submitting a dataset to the NYU Data Catalog, have a suggestion for additional datasets to add, or are willing to serve as a local expert, please use the Contact Us form.

The code used to create the NYU Data Catalog is open source and available via GitHub. Documentation and further information is available via OSF. If you would like to create a similar catalog, please use the Contact Us form to learn more about the multi-institution Data Catalog Collaboration Project.

Meet the Team

Nicole

Nicole Contaxis

Project Lead

Nicole Contaxis, MLIS is the Project Lead for the Data Catalog project at the NYU Health Sciences Library. She works alongside researchers to make research data discoverable through the NYU Data Catalog. Her areas of interest include data sharing, data ethics, and community engagement. Nicole is a former National Digital Stewardship Resident at the National Library of Medicine. She received her MLIS from UCLA, and is currently working on her M.A. in Bioethics at NYU.

Michelle

Michelle Yee

Data Catalog Coordinator

As a Data Catalog Coordinator, Michelle Yee engages with NYU researchers to promote their research products and encourage collaborative opportunities. Michelle has prior experience in clinical research within NYU Langone and is completing a MPH in Epidemiology at NYU GPH.

Ian

Ian Lamb

Senior Solutions Developer

Ian Lamb is a full-stack web developer at the NYU Health Sciences Library and is the principal developer of our data catalog. He focuses on building friendly and usable systems to advance the institution’s clinical, educational, and research goals.

Debbie

Debbie Peters

Executive Assistant to the Chair

Debbie works closely with library administration on human resource administration, finance policy and procedures and managing the day-to-day daily operations. She serves as meeting coordinator and administrative support for the NYU Data Catalog and the Data Catalog Collaboration Project (DCCP).

Alisa

Alisa Surkis

Vice Chair for Research and Assistant Director, Research Data and Metrics

Alisa Surkis, PhD, MLS is the Vice Chair for Research and Assistant Director, Research Data and Metrics for the Health Sciences Library. She serves as the senior advisor for the NYU Data Catalog and as a member of the Data Catalog Collaboration Project.

The Data Discovery Collaboration

The Data Discovery Collaboration was created to facilitate the discovery of biomedical research data that are difficult to find. The DDC is a multi-institutional consortium that have implemented local projects, programs, or technologies to index and make available data. This collaboration brings a cross-institutional perspective to addressing usability, data sharing workflows, metadata, and outreach for improving data discovery efforts.

The Mission of the DDC:

  • To enhance discovery of data and other research products in order to maximize their value

To learn more about our accomplishments, our publications, and how to join, please visit the DDC website.