Skip to main content

Data Management Toolkit @ UNH

This toolkit provides information to help researchers develop data management plans and effectively manage their research data.

What is data management

Research data management is an integral part of responsible research practice and involves implementing strategies relevant to all stages of the research data lifecycle. The Data Lifecycle - Vellucci, 2014

Data management is essential because it helps you:

  • Protect your data from loss
  • Find your data when you need it
  • Secure your data
  • Reuse your old data
  • Share your data with others
  • Recognize datasets as scholarly output
  • Improve research integrity, reproducibility, transparency




Image source: Adapted from Vellucci, S. Non-Linear Research Data Lifecycle, 2014.

Data management and research integrity

Data constitute the core of a research project. Maintaining data reliability is key to ensuring the integrity of data-based conclusions. Without proper data management, the validity of research results can be questioned, jeopardizing not only your own reputation, but also the work of others and the reputation of the University.

Responsible data management:

  • Protects data from falsification
  • Preserves confidential human subject information
  • Protects proprietary knowledge
  • Provides evidence of inventorship
  • Clarifies ownership of intellectual property rights
  • Assures that University policies and procedures are followed
  • Supports compliance with sponsor requirements
  • Protects the PI, investigators, and the University from consequences of poor data management

Also see the Responsible Conduct of Research & Scholarly Activity research guide. 

What do we mean by data?

When we talk about data in this Toolkit, we are referring to systematically recorded information that is produced as part of a research process and is the basis of research findings. Funders with data sharing and data management policies may have their own definitions.

Here are some examples of research data types and formats:

  • Sensor data
  • Telemetry
  • Field notes
  • Survey data
  • Samples
  • Text documents or spreadsheets
  • Gene sequences    
  • Images         
  • Audio or video recordings
  • Climate models
  • Economic models
  • Compiled databases    

Evaluating your data needs

Managing your data is a multi-step process. The planning for data collection is the first step, but it is also important to think about what you will do with your data when the project is completed and what long-term retention needs you might have.  In order to properly manage your data you need to understand the nature of the data, its audience and ownership, and its long-term viability. Reviewing the following questions will help you get started:

  • What type of data are you producing?
    • Gather a clear picture of what your data will look like. Is it, for example, numerical data, image data, text sequences, or modeling data? Knowing exactly will inform many decisions you need to make about storage, backups and more. Image data requires a lot of storage space, so you'll want to decide which of your images, if not all, you want to retain, and where such large data sets can be housed.
  • How much data, and at what growth rate?
    • Once you know what kind of data you're producing, you'll be able to assess the growth rate. For example, are you gathering data by hand or using sophisticated instrumentation that is able to capture a lot of data at once? Will there be more data as time goes on? If so, you will need to plan for the growth. What amounts to enough storage this year may not be sufficient for next year.
  • Will it change frequently?
    • The answer to this question impacts how you organize the data as well as the level of versioning you will need to undertake. Keeping track of rapidly changing datasets can be a challenge, so it's imperative you begin with a plan that will carry you through the data management process.
  • Who will use the data?
    • Who is your audience for the data? How will they use the data? The answer to this question will tell you how to structure the data and where to distribute it.
  • Who controls the data (you, UNH, a research center, the funder)?
    • Before you decide how you will manage the data, you need to know if you have the authority to control it or if you have to abide by external requirements.
  • How long should the data be retained? (e.g. 3-5 years, 10-20 years, permanently)
    • Not all data needs to be retained indefinitely. Figure out what's important to keep long-term and make sure your plan for those datasets is solid.

Data Sharing and Management Snafu in 3 Short Acts

More about data management best practices

The following guides cover general principles for managing your data, plus selected information related to particular formats or disciplines.

License and acknowledgement

Creative Commons License

The Data Management Toolkit is maintained by Patti Condon. It is licensed under a Creative Commons Attribution-ShareAlike 3.0 Unported License.

This guide was initially created by Sherry Vellucci and Eleta Exline, and adapted from the MIT Libraries Data Management and Publishing Guide with additional content from the CalTech Library Data Management Guide (with grateful acknowledgement to MIT Librarians and George Porter @ the California Institute of Technology)