Skip to Main Content
Go to Penn Libraries homepage   Go to Guides homepage
Banner: RDDS; Research Data & Digital Scholarship displayed between 3D mesh surfaces

Data Management Resources

File Naming Guidelines

There are two general rules for file organization: Be consistent and be descriptive. You want to make sure you and your colleagues can find anything you are looking for quickly. You'll need to figure out which specifics make the most sense to you and document your convention in a place everyone in your research group can follow. Here are some guidelines to include in your convention:

  • Choose 2-3 descriptors to identify the project or collection the item belongs to and what the specific item is. Have a standard for your research group so things can easily be found and shared.
  • Use capitals (camel case) or underscores instead of periods or spaces. Example: surveyResponseData.csv or survey_response_data.csv
  • Use no more than 30 characters whenever possible
  • Use date format ISO 8601: YYYY-MM-DD
    • The year first format makes it easy to find newest/oldest files. Wikipedia's ISO 8601 page provides additional information on the date and time standard.
  • Avoid special characters in a file name. Common things to avoid are using spaces or ampersands (&).
  • Document your naming convention so you remember what it is and your project collaborators know what it is.

File Naming Formula template screenshot

Feel free to use our File Naming Formula Template to help you and your team create meaningful file names that follow best practices. Once you fill it out, print it out and put it in a prominent place in your workplace. If you work on collaborative files, appoint someone to hold your team accountable for their file naming practices.

Version Control

Version control is the strategy of tracking of changes and edits to files and directories. This allows you to revert to previous versions if you make a mistake or even delete something! This can be a key practice for success on complicated projects and on collaborative teams. 

Even if you're tracking changes with the software you're using, you should always keep a copy of the original unedited data available and save a new version when substantial changes are made. It's like saving your progress in a video game along the way so you don't have to go back to the beginning after coming across an unexpected challenge. 

There are two main versions of conducting version control: 

  • Manual Version Control - the process of personally saving versions of your files along the research process. This is good option for those who do not have files that cooperate with software version control (such as rich text files or media files). The file storage system (Box, One Drive) or software (MS Word) you use may have some built in version control, but that is not the main purpose of the tool. Be consistent about when you save another version and how you keep track of your system. 
  • Software Version Control Systems- systems specifically designed to version control code. These are more complex than manual version control, but are more powerful and integrate into your research process easier. These systems save just the changes to your files instead of a new copy for each version.  

Research Data Engineer

Profile Photo
Lauren Phegley
she/her

Lauren Phegley holds consultations on data management, DMPTool, writing Data Management Plans (DMPs), and data sharing.

Head of Research Data Services

Profile Photo
Lynda Kellam
she/her

Head of Research Data Services

See schedule button for current dates and times. Appointments available in person and on zoom.

Subjects: Data & GIS
Penn Libraries Home Franklin Home
(215) 898-7555