Yale Data Green FAQs

Frequently Asked Questions for Yale Data Green

Yale Data Green is a research data platform that contains a collection of datasets owned and made available by various groups across the university for cloud-based querying and analysis. It is powered by Redivis, a cloud platform.

Data Green is a central place where Yale researchers can come to for research data, much like the New Haven Green serves a place for New Haven residents to come together. 

Redivis is the underlying cloud platform that hosts datasets and provides tools for: dataset storage and access controls, documentation/metadata and versioning, web-based exploration and analysis.

Data Green is geared toward active analysis and controlled access to data. Dataverse is a repository geared toward dissemination and archiving.

Yale Data Green: Optimized for accessing active research datasets that are owned and made available by various research organizations. Users can query, analyze, and collaborate around datasets with access controls and governed use. 

Yale Dataverse: A repository for publishing and preserving research outputs (datasets and related materials), often in support of citation, sharing, and long-term access/preservation. 

A mix of research-relevant datasets curated or licensed by Yale units and made available for analysis. This can include:

  • Public or widely shareable datasets
  • Yale-licensed/contracted datasets (restricted to eligible users)
  • Yale-generated datasets shared within a lab, project, department, or collaboration space

Availability and access vary by dataset.

Typically, Yale affiliates (faculty, staff, students) with an active Yale NetID can request to join a Yale organization. Some collaborations may support access for external partners, but that depends on the specific dataset’s terms and Yale’s access policies. 

During Phase 1 of implementation, only Yale Library and Data Intensive Social Science Cetner (DISSC) will be active organizations in Data Green. In Phase 2, set to begin in Fall 2026, other Yale units will be able to join as new organizations to manage their own data. Please contact researchdata@yale.edu if you are interested.

Reach out to researchdata@yale.edu for support using Yale Data Green. 

Redivis, the underlying cloud platform behind Yale Data Green, maintains a user guide, available here: https://docs.redivis.com/

Once you log into yale.redivis.com with your Yale netID, you can click “Datasets” on the bottom of the banner and see the titles for all datasets Yale manages in the Data Green.

Datasets in Yale Data Green have different levels of permissions. If you log in with your Yale netID, you will be able to see the titles of all the datasets. For some datasets, you may need to request access via a form to see the metadata and content.

Datasets in Yale Data Green have different requirements, which are listed individually for each dataset. For some, users can download the entire dataset. For others, only a portion may be downloaded.

Please contact marx.reference@yale.edu to talk to Yale Library staff about your data needs. 

Almost any tabular data can be added to Yale Data Green, including in CSV, Parquet, or JSON file formats. Some file formats, such as Excel spreadsheets, may need additional work to add to the Data Green, but it is possible. We recommend that any proprietary file formats, like Excel files, be converted into more readily accessible formats whenever possible.

Basic analysis can be performed for free using an environment with base-level RAM and CPU compute resources. Upgrading to more resources may have a cost associated with it, and your needs may be better suited by other Yale environments, such as those at the Yale Center for Research Computing (YCRC). Reach out to Yale Library at researchdata@yale.edu for help determining what resources best fit your needs!

The collaborative coding and analysis workspaces in the Data Green allow sharing of notebooks, queries, and dataset transformations, as well as data assets. The “Studies” feature can be a helpful home base for collaboration and for securely controlling access to your work.

Sign in with you Yale netID to create an account and start exploring the publicly accessible datasets! 

Yes! All datasets on Redivis have built-in version control.