Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Tracking Excel files in Version Control

We are branching out beyond the development team and trying to get other groups within my company to use version control for important documents that need change tracking. One frequent need is for Excel spreadsheets. These are large spreadsheets, modified fairly frequently (weekly or monthly) but with only a small portion of the cells changed each time.

Just sticking the files in subversion (the particular tool we are using) gives a history of changes and keeps old versions. And the TortoiseSVN client makes it easy for non-technical users. Recent versions of TortoiseSVN even contain a script which can be used to perform nice visual diffs between Excel documents.

My remaining concern is disk space. These are large documents. The diffs between versions are small, but I worry that the version control will notice that the file is binary and fall back to storing each version separately. Does anyone know of a solution to this? For instance, a format we could save in in which the diffs would be small so only differences would be saved, or a version control system which is specifically aware of Excel files? I have not yet done performance testing, but our version control server is already badly taxed and if there is a better solution I'd love to know what it is.

like image 540
mcherm Avatar asked Jan 07 '10 15:01

mcherm


People also ask

How do you track version control in Excel?

View previous versions of a file Open the file you want to view. Click File > Info > Version history. Select a version to open it in a separate window. If you want to restore a previous version you've opened, select Restore.

Can Excel files be tracked?

Turn on Excel Track Changes feature On the Review tab, in the Changes group, click the Track Changes button, and then select Highlight Changes.... In the Highlight Changes dialog box, do the following: Check the Track changes while editing. This also shares your workbook.

How can I see who updated an Excel file?

In the Review tab, select Show Changes. Changes are shown in the pane with the most recent changes on top, in the order the changes were made. You can see who made edits, exactly where in the workbook, when, and what they changed.


2 Answers

Currently SVN cannot efficently store those types of files. There has been some discussion about it though

http://subversion.tigris.org/ds/viewMessage.do?dsForumId=462&dsMessageId=651443

This SO question shows a graph when storing an OpenXML office document. The results were pretty linear

Will Subversion efficiently store OpenXML Office documents?

Although your question wasn't specifically about that format it may still apply. You might just need to run a test in SVN and see what kind of storage it takes. SVN is pretty good at storing binary files, so it might not be too terrible. The SO question above also mentions saving the file as a plain text XML 2003 document, which you might investigate also.

like image 192
Aaron M Avatar answered Sep 18 '22 14:09

Aaron M


One consideration is using Team Foundation Server for source control (if that's an option), which will just store your delta changes, although it may be a bit heavy for what you're looking for.

like image 43
Bob Palmer Avatar answered Sep 20 '22 14:09

Bob Palmer