What does a git SHA depend on?

Tags:

git

I was wondering what are all the parameters a git SHA depends on ? I am guessing there would be some other parameters like timestamp etc., besides content of the commit, on which construction of the SHA depends on.

I am interested in all such parameters on which this depends. I am also interested in the situation where all such parameters would be the same, or enforced to be same resulting in exactly the same git SHA of any two commits made by two people.

754

asked Sep 29 '15 22:09

pranavk

1 Answers

For a commit, the ID depends on checksums of at least...

The tree (all the files and directories) ID which is made up of...
- The content of all the files, not the diff, called a blob.
- The directory tree (names of files and directories and how they're organized).
- The permissions of all the files and directories.
The parent commit ID(s).
The log message.
The committer name and email and date.
The author name and email date.

If you change just about anything about the commit the commit ID changes.

Including the parent commit IDs is very important. It means two commits with exactly the same content, but built on different parents, will still have different IDs. Why would you do that? It means if the ID of two commits are the same you know their entire history is the same. This makes it very efficient to compare and update Git repositories. "I have branch foo at commit ABC123, you do too? Great, we're in sync!"

When comparing Git to other version control systems, remember that in many popular "reliable" systems, like Subversion or CVS, anyone with the file permissions can go in and undetectably change history in the central repository. With Git such tampering will be immediately detected because it will change all the downstream commit IDs, or if they brute force matched the IDs the content would be complete nonsense.

The possibility of a SHA1 collision possibility has already been considered. Long story short, in a conflict the existing object wins.

The probability of a SHA1 collision happening accidentally is so vanishingly small, I hope your asteroid, cosmic ray, and wolf attack insurances are paid up.

If all 6.5 billion humans on Earth were programming, and every second, each one was producing code that was the equivalent of the entire Linux kernel history (3.6 million Git objects) and pushing it into one enormous Git repository, it would take roughly 2 years until that repository contained enough objects to have a 50% probability of a single SHA-1 object collision. A higher probability exists that every member of your programming team will be attacked and killed by wolves in unrelated incidents on the same night.

Seriously, there are better things to worry about, like the 1 in 100 chance of a drive failure. How are your backups?

answered Sep 27 '22 21:09

Schwern

Related questions
                            
                                Why do I have to stash / commit my changes before switching branches?
                            
                                Can I have "git stash" to automatically include untracked files by default?
                            
                                git pull remote master in detached head
                            
                                Xcode 5 - How to use source control with a workspace
                            
                                Managing Git or bitbucket repositories through Sublime text [closed]
                            
                                How to check if a directory is a git repository in C#
                            
                                git: push deleted branch to remote
                            
                                AWS - Installing bower components via npm install
                            
                                Is it OK to use a long non-ascii name for the user.name Git configuration?
                            
                                Swift: Type 'ViewController' does not conform to protocol 'UIPageViewControllerDataSource'
                            
                                A complete backup of a git branch
                            
                                show git log timestamps in ISO format in user's timezone?
                            
                                Specify Git Rename after changes
                            
                                Why my git is not updating from 1.7.1 to 1.9.4
                            
                                Best practices for multiple git repositories
                            
                                How to avoid detached headstate in git?
                            
                                Force Git push on SourceForge
                            
                                Git revert deleted file and preserve file history
                            
                                Doing a `git checkout` while running `rspec`
                            
                                Intellij import issue - java.io.IOException: Cannot run program "git": error=2, No such file or directory

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With