Search here to find large public and licensed datasets

The NYU Data Catalog facilitates researchers’ discovery of data by providing a searchable and browsable online collection of datasets. Rather than functioning as a data repository, the catalog is a digital way-finder for researchers looking for datasets relevant to their work. It includes datasets generated by NYU researchers as well as publically available and licensed datasets that are generated at external organizations, e.g. the Bureau of Labor Statistics.

The NYU Data Catalog is designed to:

  • Increase the visibility of research data generated by NYU researchers
  • Facilitate collaboration across departments and institutes at NYU
  • Help NYU researchers locate and understand datasets generated at external organizations
  • Support the process of re-using research data

If you are interested in submitting a dataset to the NYU Data Catalog, have a suggestion for additional datasets to add, or are willing to serve as a local expert, please use the Contact Us form.

The code used to create the NYU Data Catalog is open source and available via GitHub. Documentation and further information is available via OSF. If you would like to create a similar catalog, please use the Contact Us form to learn more about the multi-institution Data Catalog Collaboration Project.