Organizing your data ensures that you and others can easily access and understand your research files. Effective data organization is essential for finding, retrieving, sharing, and maintaining control over files to prevent loss or accidental disclosure.
This guide provides best practices for file naming, folder structure, and version control. You will also find resources for citation management and organizing data in spreadsheets and databases.
Effective file naming helps you understand and recall a file's contents at a glance. Following these practices will help you manage, access, and share your files. Here are some best practices:
A logical folder structure helps you and others easily find and work with your files. Aim for a consistent and logical folder structure that reflects how data is collected and accessed. Avoid saving all your research files in a single folder, which can lead to confusion. Instead, create a hierarchy of folders and subfolders to organize your files. Consider organizing by:
Here’s an example directory structure. Your project might differ (e.g., keeping code and results together or organizing documentation by experiment), but this provides a general idea of how to structure your folders logically.
Example file directory image created by Michael Lenard and licensed CC-BY
Some key tips for folder organization include:
Version control allows you to track the changes you or members of your team have made to code, data, or other research outputs. It allows you to retain drafts and details of changes and understand why changes were made, who made them, and what impact they had. Version control helps with collaborative editing, coordinating teams, and maintaining a history of progress.
Strategies include: