How does git track file changes internally?

Tags:

Could somebody explain how git knows internally that files X, Y and Z have changed? What is the process behind the scenes that recognizes when a file has not yet been added or has modifications? I am asking because, with Subversion it's simple to figure out that it keeps track of these things by having a .svn directory under each folder, but for git I can't seem to find a description of the inner workings of this. I doubt it scans through all the sub-directories for changes, as it's quite fast.

So, out if curiosity, what are it's inner workings?

717

asked Apr 02 '13 13:04

carlspring

2 Answers

The mechanisms by which one determines the status of a file is fairly straightforward. To know what files have been staged, one simply diffs the HEAD tree with the index. Any items that appear only in the index have been staged for addition, any items that appear only in HEAD have been removed and any items that are different have had changes staged.

Similarly, one would detect unstaged changes by diff'ing the index with the working directory.

Your question in particular asks how this can be so fast (after all, computing the SHA1 hash of a file is not exactly speedy.) This is where the index - also known as the cache - comes in to play again. The index also has fields for the file size and file modification time. Thus one can simply stat(2) a file on disk and compare against the index's file size and file modification time to know whether to hash the file or not.

200

answered Oct 04 '22 00:10

Edward Thomson

You can find your answer in the free book Pro-Git on chapter Git Internals

This chapter explains how git works behind the hood.

As Leo stated, git checks the SHA1 of the files to see if it has changed you can check it like this (Taken from Git Internals):

$ echo 'version 1' > test.txt $ git hash-object -w test.txt 83baae61804e65cc73a7201a7252750c76066a30

Then, write some new content to the file, and save it again:

$ echo 'version 2' > test.txt $ git hash-object -w test.txt 1f7a7a472abf3dd9643fd615f6da379c4acb3e3a

answered Oct 03 '22 23:10

stdcall

Related questions
                            
                                NodeJS + Express - Apply session middleware to some routes
                            
                                Changing ActionBar tabs underline color programmatically
                            
                                Assigning materials to an OBJLoader model in three.js
                            
                                How to use auto with const and & in C++?
                            
                                JavaFX Pass MouseEvents through Transparent Node to Children
                            
                                Is it somehow possible to style an iframes before/after pseudo-element?
                            
                                Is it possible to define a jax-rs service interface separated from its implementation (with eclipse and jersey)?
                            
                                //! [0] in Qt source code
                            
                                How to POST the data from a modal form of Bootstrap?
                            
                                Ubuntu, remove network TAP interface
                            
                                How to re-install lxml?
                            
                                No standard way to compare smart pointer with regular pointer?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With