About the Data Catalog

The NYU Data Catalog facilitates researchers’ discovery of data by providing a searchable and browsable online collection of datasets. Rather than functioning as a data repository, the catalog is a digital way-finder for researchers looking for datasets relevant to their work. It includes datasets generated by NYU researchers as well as publically available and licensed datasets that are managed by external organizations, e.g. the Bureau of Labor Statistics.

The NYU Data Catalog is designed to:

  • Increase the visibility of research data generated by NYU researchers
  • Facilitate collaboration across departments and institutes at NYU
  • Help NYU researchers locate and understand datasets generated at external organizations
  • Support the re-use of research data in secondary analysis

If you are interested in submitting a dataset to the NYU Data Catalog, would like to suggest additional datasets for inclusion, or are willing to serve as a local expert, please use the Contact Us form.

The code used to create the NYU Data Catalog is open source and available via GitHub. Documentation and further information is available via OSF. If you would like to create a similar catalog, please use the Contact Us form to learn more about the multi-institution Data Discovery Collaboration.

Meet the Team


Nicole Contaxis

Data Services Librarian, Lead of Data Discovery

Nicole Contaxis, MLIS MA is the Lead for Data Discovery and the NYU Data Catalog at the NYU Health Sciences Library. She works alongside the research community to improve data sharing and discovery. Her areas of interest include data sharing and governance, data ethics, and community engagement. Nicole is a former National Digital Stewardship Resident at the National Library of Medicine. She received her MLIS from UCLA, and her MA in Bioethics from NYU.


Michelle Yee

Data Catalog Coordinator

Michelle Yee, MPH, engages with NYU researchers working in the areas of clinical and population health to promote their work and encourage collaborative opportunities. Michelle has prior experience in clinical research within NYU Langone through the Clinical and Translational Science Institute and a MPH in Epidemiology from NYU School of Global Public Health.


Ummea Urmi

Data Catalog Coordinator

Ummea Urmi, MS, works with NYU researchers in the basic sciences to make their data more discoverable through the NYU Data Catalog. Ummea has previously worked at a neuroscience lab at Columbia University through the Howard Hughes Medical Institute. She received her MS in Toxicology from St. John’s University for which she was the recipient of the Clare Boothe Luce Fellowship.


Ian Lamb

Senior Solutions Developer

Ian Lamb is a full-stack web developer at the NYU Health Sciences Library and is the principal developer of our data catalog. He focuses on building friendly and usable systems to advance the institution’s clinical, educational, and research goals.

The Data Discovery Collaboration

The Data Discovery Collaboration was created to facilitate the discovery of biomedical research data that are difficult to find. The DDC is a multi-institutional consortium that has implemented local projects, programs, or technologies to index and make available data. This collaboration brings a cross-institutional perspective to addressing usability, data sharing workflows, metadata, and outreach for improving data discovery efforts.

The Mission of the DDC:

  • To enhance discovery of data and other research products in order to maximize their value

To learn more about our accomplishments, our publications, and how to join, please visit the DDC website.

Icons on the homepage are made by Vectors Market, Gregor Cresnar, and monkik, from www.flaticon.com.