Do you need to find data for your research? We can assist you.

On this page you can find out about data resources subscribed to by UC Davis Library or about obtaining raw research data shared directly by scientists.

There are two ways to approach finding research data. There may be national or subject-level repositories that will contain the best match for the data you need. However, most of the times finding data cannot be accomplished by one-stop shopping. Data that do not have clear subject repositories are often hosted in general repositories or institutional repositories. To identify these, use registries–catalogs of repositories, searchable by subject keywords.

 

 

 

Click here to search the UC Davis Library full database listing or see a selected list below.

There are also a number of datasets and software shared by UC Davis researchers. These are only a small fraction of the data UC Davis researchers share through their websites.

USDA National Agricultural Statistics Service “Quick Stats” statistics database. Select U.S., state, or county data for crops, livestock and other agricultural commodities. Query database by geographic area, search by commodity, or view data in charts or maps. Data sets include Census of Agriculture.

American Factfinder is the Census Bureau’s data access point. It provides tools to look up, view and deliver data, reports, tables, maps and fact sheets on United States population, economy and industry statistics. Large and small scale geographies, eg., states, counties, places, census tracts and blocks can be searched. Data sets are from the Decennial Census, American Community Survey, Annual Population Estimates, Economic Census, and annual economic surveys. All data sets and statistics may be freely downloaded or viewed online.

Coverage: Data sets are current and include 2000 Census and some 1990 Census data and American Community Survey data.
Tags: Census, Economics, Social Science, United States Of America
Related links: American FactFinder: How do I find all the Census Tracts in a Place?

The Association of Religion Data Archives (ARDA) strives to democratize access to the best data on religion. Founded as the American Religion Data Archive in 1997 and going online in 1998, the initial archive was targeted at researchers interested in American religion. The targeted audience and the data collection have both greatly expanded since 1998, now including American and international collections and developing features for educators, journalists, religious congregations, and researchers. Data included in the ARDA are submitted by the foremost religion scholars and research centers in the world. Currently housed in the Social Science Research Institute at the Pennsylvania State University, the ARDA is funded by the Lilly Endowment, the John Templeton Foundation, and the Pennsylvania State University.

Tags: Religion

Provides access to statistics for a variety of U.S. Department of Justice reports and documents as well as some international statistics on crime and criminal justice. Link to data from the FBI’s Uniform Crime Reports and download data series. Includes surveys of DNA Crime Laboratories.

U.S. Bureau of Labor Statistics Data access point. Labor statistics include prices, inflation, employment, consumer price indices data and tables.

Tags: Economics

The State of California’s Employment Development Department offers a link to InfoUSA’s employer directory for the state as part of its Data Library. This database can be searched by keyword, NAICS code, county, or industry.

Tags: Business, Labor

Center for Disease Control and Prevention (CDC) Wonder provides a single pont of access to a wide variety of reports and numeric public health data. Drill down by topic, browse a-z list, or search by keyword to find statistics and data.

The Census of Marine Life (CoML) is a research program that seeks to assess the diversity, distribution and abundance of ocean life and to explain how it changes over time. The portal provides access to a global network of researchers in more than 45 nations engaged in this ten-year marine life initiative. Includes three major projects: Ocean Biogeographic Information System (OBIS), the History of Marine Animal Populations project (HMAP), and the Future of Marine Animal Populations project (FMAP). Seven studies have been initiated to collect data for these projects: Biogeography of Deep-Water Chemosynthetic Ecosystems (ChEss); Census of Diversity of Abyssal Marine Life (CeDAMaR); Gulf of Maine Program (GoM); Mid-Atlantic Ridge Ecosystems Project (MAR-ECO); Natural Geography in Shore Areas (NaGISA); Pacific Ocean Shelf Tracking Program (POST); and Tagging of Pacific Pelagics (TOPP). {RG 12/10/13}

The China Data Online is the primary data source for China studies. It includes (1) China Statistical Databases; (2) China Census Databases; and (3) China Spatial Data Service (China Geo-Explorer). It provides easy access to the various statistical yearbooks published by the National Bureau of Statistics of China, comprehensive statistics, and Census data of economy and population at national, provincial, city, county, and even township levels.

“EH.Net provides an on-line location for researchers in economic history to make their data series available to other professionals and interested scholars. Several data series have been given to EH.Net and are available as downloadable files, while many other titles may be accessed through our Database Directory”.

Sanborn fire insurance maps, created to assist fire insurance companies assess risk, are large-scale community plans drawn at a scale of 50 feet to one inch. The maps give street names, street and sidewalk widths, property boundaries, and house and block numbers. Sanborn maps also contain data about construction details, building use and the utilities and transportation infrastructure that supported the community overall. Several editions of maps exist for some cities.

Coverage: 1867-1970
Tags: California, Geography

Federal Researve Economic Data (FRED) from the Federal Reserve Bank of St. Louis. The database contains over 13,000 U.S. economic time series. With FRED® you can download data in Microsoft Excel and text formats and view charts of data series.

Tags: Economics

Provides single chemical toxicity information for aquatic and terrestrial life which is useful for examining impacts of chemicals on the environment. Peer-reviewed literature is the primary source for the database including information on the species, chemical, test methods, and results. Another source of test results is independently compiled data files (such as the Pesticide Ecotoxicity database) provided by various United States and International government agencies. ECOTOX is a unified interface providing access to three U.S. Environmental Protection Agency (U.S. EPA) ecological effects databases: AQUIRE (all aquatic species including freshwater and marine); TERRETOX (terrestrial animal mainly wildlife); and PHYTOTOX (terrestrial plant). Ecology, Toxicology.

Coverage: 1972-present
Tags: Ecology, Environmental Science, Pollution, Toxicology

Eurostat is the statistical agency for the European Union. The site provides access to data (monthly, quarterly, and annual data) from 1960 onwards. Topics include: general statistics, regional statistics, economy and finance, agriculture and fisheries, environment and energy, population and social conditions, services and transport and intra- extra- EU trade statistics. Coverage is Europe.

The website of the Foreign Agricultural Service (FAS) of the U.S. Department of Agriculture (USDA) provides access to statistics on trade, various reports on agricultural commodities and data, agriculture in other countries, information on federal export programs and to the magazine AgExporter.

FedStats is a gateway to statistics from over 100 United States Federal government agencies. Link directly to statistical data from agencies, for example, the Census Bureau, and the National Agricultural Statistics Service. Access statistics by topics or subjects like crime, education, etc. Search across Federal government agencies using words or phrases to find agencies with statistics. Data sets vary by agency and format.

The GIS DataDepot is a commercial data site that has numerous data holdings available for free download. Additional data may be downloaded via a “Premium” option or written to CD-ROM for a fee. No-fee registration is required to download even the free data. Coverage is worldwide but generally not very detailed (VMAP0). US coverage includes TIGER/Line files; soils data (STATSGO and SSURGO); DEMs; and transportation, utilities, hypsography,and vegetation data at 1M.

Coverage: worldwide
Tags: Geographic Information Systems (Gis), Geography

​The Google Public Data Explorer makes large datasets easy to explore, visualize and communicate. The Data Explorer features data sets from the World Bank, the OECD, Eurostat, the U.S. Census Bureau, the U.S. Bureau of Labor Statistics, the U.S. National Center for Health Statistics, and the California Department of Education.

Tags: Geography

The Global Terrorism Database (GTD) is an open-source database including information on terrorist events around the world from 1970 through 2013 (with additional annual updates planned for the future). Unlike many other event databases, the GTD includes systematic data on domestic as well as transnational and international terrorist incidents that have occurred during this time period and now includes more than 125,000 cases. For each GTD incident, information is available on the date and location of the incident, the weapons used and nature of the target, the number of casualties, and–when identifiable–the group or individual responsible.

Tags: Politics

Statistical data from the Yearbook of Immigration Statistics, profiles on legal permanent residents, and on naturalized citizens, mapping information available in pdf as well as downloadable data formats. Statistics include numbers of immigratants, status, country of origin, etc. Data available from 2000, with archived information from the Yearbook of Immigration Statistics back to 1996.

National Center for Health Statistics data center. Data by topics, eg., births, deaths, disease. Links to CDC Data Warehouse for access to health statistics and data including vital statistics, detailed data sets, and survey data.

Related: NCHS Data Access

Tags: Medicine

OceanPortal is a directory of ocean data and information related web sites. Covers topics such as: organizations, projects and programs including data centers; environment and governance; scientists and ships; and commerce and trade. Frequently used resources include photo libraries of the Crustacean Gallery, Globe Project Maps & Graphs, and ASF ERS-1 SAR Image Gallery.

23

VPN

REAXYS is the web-based replacement for the CrossFire platform. It provides access to chemical property, structure, and reaction data from the Beilstein (organic chemistry) and Gmelin (inorganic and organometallic chemistry) databases. The Beilstein database contains information on over 8 million compounds and more than 5 million chemical reactions. It provides comprehensive coverage of reactions, structures, and properties in organic chemistry; as well as references to the literature. For each compound, up to 350 chemical and physical data fields of critically evaluated data are available. The Gmelin database contains information on over 1.4 million compounds, including: coordination compounds, alloys, solid solutions, glasses and ceramics, polymers, and minerals. It provides comprehensive coverage of structures and properties in inorganic and organometallic chemistry; references to the literature and information on reactions (over 900,000) are also included.

Coverage: 1771 - present.
Tags: Chemistry, Organic Chemistry, Organometallic Chemistry

The Roper Center for Public Opinion Research, currently located at Cornell University, is one of the world’s leading archives of social science data, specializing in data from public opinion surveys. The Center’s mission is to collect, preserve, and disseminate public opinion data; to serve as a resource to help improve the practice of survey research; and to broaden the understanding of public opinion through the use of survey data in the United States and abroad. Founded in 1947, the Roper Center holds polling data ranging from the 1930s, when survey research was in its infancy, to the present. Over 650,000 questions of U.S. samples can be searched at the question-level in iPOLL, while RoperExpress offers downloads of over 20,000 datasets from over 100 countries.

25

VPN

Data from more than 100 different government and non-government sources. Collection of more than 6,000 current and historical data series on on high-interest research topics dating back more than 20 years. Discover, view, and export key information measures for State Stats and Local Stats. Topics including: Agriculture, Crime and Law Enforcement, Defense, Economy,Education, Education and Culture, Employment and Labor, Geography, Government Finances: Federal, Government Finances: State And Local, Health, Households And Housing, Immigration, Population, Population And Politics, Religion, Social Welfare, and Transportation.

The primary source for those who need to evaluate the hazardous potential of substances used in commerce. Combines data on toxicological, flammability, reactivity, explosive potential, and regularity information on approximately 26,000 substances, including 2600 new entries. Also includes Immediately Dangerous Life or Health (IDLH) levels for approximately 1,000 chemicals. It also covers exposure-level classifications for a number of regulatory agencies, from OSHA to the U.S. Department of Transportation. Especially important in areas of industrial hygiene, safety, emergency response, law, and policy making.

Coverage: 2005
Tags: Chemistry, Toxicology

27

VPN

Statista is a statistics portal that provides access to data on many different topics from more than 18,000 different sources. Topics include agriculture, advertising, health, hospitality, consumer goods, and much more. Sources are government agencies and business sources. The focus on current (not historical) statistics.Data is Downloadable in Excel, PowerPoint, and JPG.

The Arabidopsis Information Resource (TAIR) provides a comprehensive resource for the scientific community working with Arabidopsis thaliana, a widely used model plant. TAIR consists of a searchable relational database, which includes many different datatypes. Data can be viewed using a interactive MapViewer, and analyzed with our tools. Datasets can be downloaded. In addition, pages on news, information on the Arabidopsis Genome Initiative (AGI), Arabidopsis lab protocols, and useful links are provided.

A website for transportation researchers and analysts, aimed at providing a broad collection of intermodal transportation data collected by various agencies within the U.S. Department of Transportation or by other federal agencies.

U.S. Political Stats contains over 225,000 data series–or data measures over time about people, places, and policies–across 10 distinct political science data categories, aligned with a typical political science curriculum. The data series span as far back as 1982, and are updated on a timely basis in accordance with the collection and publication schedules of original sources. Detailed source information, with links where applicable, is provided for every data series.

The United States Census Bureau is responsible for the Census of Population and Housing. It also gathers other national demographic and economic data. As part of the United States Department of Commerce, the Census Bureau serves as a leading source of data about America’s people and economy.

Access current and cumulative U.S. export and import data for over 9,000 export commodities and 17,000 import commodities. USA Trade Online provides trade statistics using the Harmonized System (HS) up to the 10-digit level and the North American Industry Classification System (NAICS) commodity classification codes up to the 6-digit level. Data for many series goes back to 1992.

The World Bank’s Open Data initiative is intended to provide all users with access to World Bank data. The data catalog is a listing of available World Bank data sources. This listing will continue to be updated as additional data resources are added. These resources include databases, pre-formatted tables and reports. Each of the listings includes a description of the data source and a direct link to that source. Where possible, the databases are linked directly to a selection screen to allow users to select the countries, indicators, and years they would like to search. Those search results can be exported in different formats. Users can also choose to download the entire database directly from the catalog.

Tags: Economics

The WTO provides quantitative information in relation to economic and trade policy issues. Its data-bases and publications provide access to data on trade flows, tariffs, non-tariff measures (NTMs) and trade in value added.

Registry Searched across
UNdata United Nations data on crime, education, health, global indicators, population, trade and others
Data.gov U.S. Government open data, including state data
DataCite Data with registered DOIs across repositories
DataOne Ecological, environmental and archeological databases
DataMed Biomedical data repositories. This is a beta version search engine developed for NIH BD2K Data Discovery Index

 

Repository Subject
National Center for Biotechnology Information Genetics, gene expression, genomics, proteomics, assays
Interuniversity Consortium for Political and Social Research (ICPSR) Political and social surveys, including public health studies; allows variable-level searches
University of California Research eXchange (UCReX) Cross-campus queries of clinical aggregate data from 12 million de-identified patient records.
The Knowledge Network for Biocomplexity Environmental and ecological datasets
Dash/Merritt General subject repository hosted by California Digital Library
Harvard Dataverse Network General subject repository hosted at Harvard
Dryad General subject repository hosting data associated with publications
figshare General subject repository

Not finding what you need? Contact us. We will search deeper. We can also assist you with identifying the right government agency that may have the data you need.

Need to scrape data for your research? Consider contacting Digital Scholarship or the Data Science Initiative.

There are several computational resources available to UC Davis researchers.

Resource Description
Amazon Web Services Cloud computing, database hosting
Microsoft Azure Infrastructure-as-a-service (IaaS) and Platform-as-a-Service (PaaS)
Research Cores and Facilities Experimental data collection and analysis
XSEDE High performance supercomputing provided by NSF. Startup allocations available for free. Proposal required for extensive computing needs. Contact Bill Broadley for more information.
San Diego Supercomputer Center SDSC is a UC-wide HPS resource. To set up a project, contact services@sdsc.edu.