Research Data Lifecycle

Research Data Lifecycle

Research Data Lifecycle

Plan and design

Create data management plans (DMPs) that outline how data will be collected, described, stored, and shared. 

Yale Library offers consultations on best practices for creating data management plans and designing data-intensive research projects. The Library also offers access and training for DMPTool, a resource to help researchers draft their data management plans. 

The Cushing/Whitney Medical Library supports data management and sharing in compliance with NIH requirements.

Two staff members looking at a computer screen.

Collect and acquire

Gather or generate data by fieldwork, experiments, simulations, or collect data from existing resources, databases, and archives.  

Yale Library procures datasets and databases for university-wide use.. 

Subject specialist librarians offer consultations and instruction on finding and accessing data, for example by FOIA request or through public data APIs. 

The Data-Intensive Social Science Center (DISSC) procures datasets for department or project-specific needs.  

The Cushing/Whitney Medical Library supports access to biomedical, clinical, and bioinformatics data resources.

Two people looking at data on a computer screen.

Analyze and collaborate

Process and analyze collected data to extract meaningful insights and test hypotheses.  

StatLab and the DHLab provide consultation and training for statistical and digital humanities analysis. 

YCRC supports Yale’s advanced computing infrastructure.  

Yale Library offers workshops and consultations with subject and methods specialists, on a wide range of analysis techniques, from text mining to regression analysis to raster analysis to network analysis and more.  

A group of people looking at someone presenting data on a large screen.

Curate and preserve

Organize, document, and describe your data to ensure it remains accessible and understandable over the long term.  

Yale Library offers consultations on best practices for file organization and naming conventions, metadata, long-term preservation strategies, and expert code review. 

DISSC and the Institute for Social and Policy Studies support YARD, a data curation workflow tool for social scientists.  

A person working on a computer in the Digital Humanities Lab.

Publish and share

Make data accessible to the broader research community to promote transparency and reproducibility. 

Yale Library supports the Yale Dataverse, a generalist repository for archiving, sharing, and accessing any kind of publication-ready research data. 

Yale Library offers consultations on data websites and visualizations and advises on open access publishing, licenses, and copyright.  

Three people around a computer screen talking.

Discover and reuse

Find and use existing data collections for new research projects, analyses, and studies.  

Yale Library supports the Yale Data Green platform for exploration and computation. 

Yale Library and DISSC are building services around Open Research and Reproducibility. 

Two people working in the Digital Humanities Lab.