How do we check out a git commit including submodules as they were at that time? One reason why we might want this is to look at a previous version of the main program for which we need to rebuild it with the submodules in the version that was used at the time of the commit. Given this, we could even use this in regular workflow: <ul> <li>First update all submodules with <code>git submodule update --remote --merge</code>, then try to build to see if the program can work with the newest version of all submodules. </li> <li>If it works we are done. If it does not work, then we could go to the previous version of the program incl. the submodule-versions it used and with which it works. </li> <li>Then update the submodules one-by-one and change the program to work with them.</li> </ul> We can kind-of do it by manually looking at each submodule: which commit had the appropriate timestamp (and hope that the program used the then-most-uptodate version). It would be much better if we could see commit X of the program used submodule commit Y. And check those out for each submodule.

In this case, you just need to run <code>git submodule update --checkout</code> (no <code>--merge</code>, no <code>--remote</code>) after checking out a previous commit. There is a lot of confusion around submodules. The basics are actually fairly simple though: <ul> <li>Each submodule is its own Git repository.</li> <li>From the submodule, one refers to the "containing" Git as a superproject.</li> <li>The superproject records the URL and path—these being things you would normally control or provide by running <code>git clone</code>—for each of its submodules in the <code>.gitmodules</code> file.</li> <li>Meanwhile, when you make a commit in the superproject, this commit contains, in its snapshot, all the normal trees and files as usual, but also, for each submodule, the commit ID to check out when checking out the submodule.1 </li> </ul> This has the effect of "freezing" the appropriate submodule commit into each superproject commit. It is—or was originally—intended to manage third party code, where the submodule itself changes rarely compared to the superproject. This model is not at all flexible, and is not suitable for the way many people want to use submodules, which is to keep them at the tip of some branch. So submodules grew the ability to update to branch names, or to be worked-in and have the work rebased and/or merged. These new abilities spawned the <code>submodule.name.update</code> configuration entries and <code>git submodule update --remote</code> options. If you have not configured any of these items, <code>git submodule update</code> alone will check out the desired (recorded) submodule commits for each submodule recorded in the current, i.e., <code>HEAD</code>, commit of the superproject. If you have configured some of these, you can use <code>git submodule update --checkout</code> to override the configuration and cause a <code>git checkout hash-id</code> in each submodule. Note that adding <code>--force</code> makes Git do this submodule checkout even if the <code>HEAD</code> is already at that entry. But since each submodule is its own Git repository, the submodule's checkout has its own interaction with its own (per-repository / per-work-tree) index and work-tree.2 Again, every submodule is its own Git repository, which means a submodule of the current superproject may have submodules of its own. If so, this makes the submodule a superproject as well, and this is where the <code>--recursive</code> flag comes in. If you are not nesting submodules, none of this complexity will affect you. <hr> 1In other words, the index for the superproject has an entry for each submodule. The type of this index entry is "gitlink", which stores the SHA-1 read from <code>HEAD</code> in the submodule. These gitlink entries are treated as sort of a weird cross between a symlink and a directory. 2In other words, if you have manually entered one of the submodules and modified the index and/or work-tree, the <code>git checkout</code> run inside that submodule, if any, may still carry your modifications into the new checked-out commit.

Check out a git commit including submodules as they were at that time

Tags:

git

git-submodules

How do we check out a git commit including submodules as they were at that time?

One reason why we might want this is to look at a previous version of the main program for which we need to rebuild it with the submodules in the version that was used at the time of the commit.

Given this, we could even use this in regular workflow:

First update all submodules with git submodule update --remote --merge, then try to build to see if the program can work with the newest version of all submodules.
If it works we are done. If it does not work, then we could go to the previous version of the program incl. the submodule-versions it used and with which it works.
Then update the submodules one-by-one and change the program to work with them.

We can kind-of do it by manually looking at each submodule: which commit had the appropriate timestamp (and hope that the program used the then-most-uptodate version). It would be much better if we could see commit X of the program used submodule commit Y. And check those out for each submodule.

658

asked Jan 29 '17 10:01

Bernd Elkemann

1 Answers

In this case, you just need to run git submodule update --checkout (no --merge, no --remote) after checking out a previous commit.

There is a lot of confusion around submodules. The basics are actually fairly simple though:

Each submodule is its own Git repository.
From the submodule, one refers to the "containing" Git as a superproject.
The superproject records the URL and path—these being things you would normally control or provide by running git clone—for each of its submodules in the .gitmodules file.
Meanwhile, when you make a commit in the superproject, this commit contains, in its snapshot, all the normal trees and files as usual, but also, for each submodule, the commit ID to check out when checking out the submodule.¹

This has the effect of "freezing" the appropriate submodule commit into each superproject commit. It is—or was originally—intended to manage third party code, where the submodule itself changes rarely compared to the superproject.

This model is not at all flexible, and is not suitable for the way many people want to use submodules, which is to keep them at the tip of some branch. So submodules grew the ability to update to branch names, or to be worked-in and have the work rebased and/or merged. These new abilities spawned the submodule.name.update configuration entries and git submodule update --remote options.

If you have not configured any of these items, git submodule update alone will check out the desired (recorded) submodule commits for each submodule recorded in the current, i.e., HEAD, commit of the superproject. If you have configured some of these, you can use git submodule update --checkout to override the configuration and cause a git checkout hash-id in each submodule. Note that adding --force makes Git do this submodule checkout even if the HEAD is already at that entry. But since each submodule is its own Git repository, the submodule's checkout has its own interaction with its own (per-repository / per-work-tree) index and work-tree.²

Again, every submodule is its own Git repository, which means a submodule of the current superproject may have submodules of its own. If so, this makes the submodule a superproject as well, and this is where the --recursive flag comes in. If you are not nesting submodules, none of this complexity will affect you.

¹In other words, the index for the superproject has an entry for each submodule. The type of this index entry is "gitlink", which stores the SHA-1 read from HEAD in the submodule. These gitlink entries are treated as sort of a weird cross between a symlink and a directory.

²In other words, if you have manually entered one of the submodules and modified the index and/or work-tree, the git checkout run inside that submodule, if any, may still carry your modifications into the new checked-out commit.

141

answered Oct 25 '22 13:10

torek

Related questions
                            
                                Using trickle with Git
                            
                                Git repository size is larger than it should be
                            
                                Pharo project on Git
                            
                                How Do I Properly Configure Feature Branch CI with TeamCity
                            
                                How to squash a lot of commits automatically?
                            
                                how to change gitconfig location?
                            
                                How can I define an alias for a Git subcommand (e.g. for `list` in `git stash list`)?
                            
                                resolve git conflict with plumbing commands
                            
                                Project layout with vagrant, docker and git
                            
                                Move git configuration from Windows to Ubuntu
                            
                                How do I view all Git pull requests across repositories in TFS?
                            
                                Sync directories containing git repository with unison
                            
                                git equivalent of 'hg share'?
                            
                                "Cherry-pick" in Github App for Mac
                            
                                Git Grep Multiple Words on Multiple Lines
                            
                                Create a new branch with tracking information
                            
                                Eclipse, Git and Bitbucket - Can't push - Error 401 Unauthorized
                            
                                Xcode Continuous Integration: Configured destination not found
                            
                                Can I link git submodules with some kind of fallback URL? If SSH clone fails, git should be able to clone using https
                            
                                Gerrit: is there a way to push just the top commit to the same branch?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With