Planning will assist you in overcoming a number of obstacles in your research. For example, if you have a pre-defined naming schema for your variables and file names, it will be much easier to find the right data and the right version in the future. If you have estimated the amount of data you will collect, you will be able to request grant funds for curating and managing them.

Planning is particularly important in longitudinal studies, studies that involve surveys, projects that result in multiple data files, including images and video, and Big Data.

A number of agencies now request data management plans to accompany proposals.

The data management plan should address

  1. How the data will be collected
  2. The type or format of data collected
  3. The size of the data
  4. How the data will be described (i.e will you be using codebooks, logs, specific metadata standards, ontologies, etc.)
  5. Where the data will be stored, backed up and secured if necessary
  6. How the data will be analyzed
  7. How the data will be shared and preserved, or reasons not to do so

Several helpful resources

A software carpentry-produced guide to data management, in particular metadata and version control.  http://v4.software-carpentry.org/data/mgmt.html
UC Davis researchers have access to the DMPtool, a service of the California Digital Library, with their Kerberos login. The tool contains templates from multiple federal and private funders. The tool also permits the user to create an editable document for submission to a funding agency, and can accommodate different versions as funding requirements change.
Examples of plans https://dmptool.org/plans/15422.pdf
https://dmptool.org/plans/20143.pdf
https://dmptool.org/plans/14517.pdf
https://dmptool.org/plans/14019.pdf

 

Contact us for further assistance at dataserv@ucdavis.edu.

 

Working with and planning for data management with geospatial data has unique challenges.

Metadata

Most GIS programs will allow you to create basic metadata that will reside along side the spatial and attribute data you create.  Several government agencies and standards bodies have developed metadata standards for geospatial data.  You should select a standard to follow based on what information you need to convey to potential users and who those users will likely be.  Funding bodies may also set requirements that should be considered.

Description of Data Creation

Spatial analysis can generate a large number of intermediate files.  Document the analysis workflow you follow as you perform it, noting which files and processes were used to generate each subsequent file.  Some researchers write out a list of steps, while others use a flowchart, or a software system like ArcGIS’ Model Builder.

Sharing & Preservation

The files we work with for analysis may not necessarily be the ideal format for sharing or storing geospatial data.  For example, when sending a shapefile, it can be easy to forget to include one of the multiple files required to properly use the data.  Consider storing and sharing data in open formats (i.e. a format that doesn’t need a specific software to open it) to make your data accessible by the largest number of people.