There are two general rules for file organization: Be consistent and be descriptive. You want to make sure you and your colleagues can find anything you are looking for quickly. You'll need to figure out which specifics make the most sense to you and document your convention in a place everyone in your research group can follow. Here are some guidelines to include in your convention:
Feel free to use our File Naming Formula Template to help you and your team create meaningful file names that follow best practices. Once you fill it out, print it out and put it in a prominent place in your workplace. If you work on collaborative files, appoint someone to hold your team accountable for their file naming practices.
One of the key aspects of data management is keeping an organized file directory. If you have a streamlined and consistent way of organizing your files into directories, it saves you from having to continuously search through directories to find the right file. There is no 'best' way to organize your directories, but we have found that there are wrong ways. Below are the methods we have found that make for smoother directory organization:
A good way to brainstorm the best directory structure for you is to identify the largest "buckets" of content you will have and what they attributes that they have in common. For example to organize files for college courses, the buckets of content might be academic year, course, homework, and readings. Using the largest bucket first, create nested directories to hold the content.
Download our Template Directory Structure to look at how you might structure a complex and data heavy research project. You can alter this template to suite your research needs and help map out the structure of your directory.
Example Directory Organization for College Courses (slash indicates directory):
Version control is the strategy of tracking of changes and edits to files and directories. This allows you to revert to previous versions if you make a mistake or even delete something! This can be a key practice for success on complicated projects and on collaborative teams.
Even if you're tracking changes with the software you're using, you should always keep a copy of the original unedited data available and save a new version when substantial changes are made. It's like saving your progress in a video game along the way so you don't have to go back to the beginning after coming across an unexpected challenge.
There are two main versions of conducting version control:
The process of personally saving versions of your files along the research process. This is good option for those who do not have files that cooperate with software version control (such as rich text files or media files). The file storage system (Box, One Drive) or software (MS Word) you use may have some built in version control, but that is not the main purpose of the tool. Be consistent about when you save another version and how you keep track of your system.
Lauren Phegley holds consultations on data management, DMPTool, writing Data Management Plans (DMPs), and data sharing.
Director of Research Data & Digital Scholarship
See schedule button for current dates and times. Appointments available in person and on zoom.