Find data

Do you need to find data for your research? We can assist you.

On this page you can find out about data resources subscribed to by UC Davis Library or about obtaining raw research data shared directly by scientists.

There are two ways to approach finding research data. There may be national or subject-level repositories that will contain the best match for the data you need. However, most of the times finding data cannot be accomplished by one-stop shopping. Data that do not have clear subject repositories are often hosted in general repositories or institutional repositories. To identify these, use registries–catalogs of repositories, searchable by subject keywords.

 

 

 

Registry

Searched across

Apertio Open data catalogs (beta version)
UNdata United Nations data on crime, education, health, global indicators, population, trade and others
Data.gov U.S. Government open data, including state data
DataCite Data with registered DOIs across repositories
DataOne Ecological, environmental and archeological databases
DataMed Biomedical data repositories. This is a beta version search engine developed for NIH BD2K Data Discovery Index
Google Dataset Search Datasets that have schema.org metadata. This is a beta version search engine developed by Google.

 

Repository Subject
National Center for Biotechnology Information Genetics, gene expression, genomics, proteomics, assays
Interuniversity Consortium for Political and Social Research (ICPSR) Political and social surveys, including public health studies; allows variable-level searches
University of California Research eXchange (UCReX) Cross-campus queries of clinical aggregate data from 12 million de-identified patient records.
The Knowledge Network for Biocomplexity Environmental and ecological datasets
Dash/Merritt General subject repository hosted by California Digital Library
Harvard Dataverse Network General subject repository hosted at Harvard
Dryad General subject repository hosting data associated with publications
figshare General subject repository

Resource

Description

Amazon Web Services Cloud computing, database hosting
Microsoft Azure Infrastructure-as-a-service (IaaS) and Platform-as-a-Service (PaaS)
Research Cores and Facilities Experimental data collection and analysis
XSEDE High performance supercomputing provided by NSF. Startup allocations available for free. Proposal required for extensive computing needs. Contact Bill Broadley for more information.
San Diego Supercomputer Center SDSC is a UC-wide HPS resource. To set up a project, contact services@sdsc.edu.

Not finding what you need or need support getting data through an API? Contact us.

We can also assist you with identifying the right government agency that may have the data you need.

Need to scrape data for your research? Consider contacting Digital Scholarship or the Data Science Initiative.