What are the pros-cons of using git worktrees vs maintaining multiple clones with <code>--reference</code> flag? The main scenario I am considering is when a developer needs to maintain multiple git repositories on the disk for old releases (release/1.0, release/2.0, release/3.0) because switching branches on a single git repo and rebuilding would be costly. Using worktrees the developer could have a single clone of the repo, and any old releases could be created as worktrees of the repo using <code>cd /opt/main/</code>, <code>git worktree add /opt/old_release_1 release/1.0</code>. Using reference clones, the developer maintains a main clone somewhere, and uses <code>cd /opt/old_release_1</code>, <code>git clone --reference /opt/main/.git ssh://git@github.com/myrepo.git</code> to create clone repositories for the old releases. It seems like they can both accomplish the same goal. Are there benefits to one over the other in terms of speed, disk space... other things?

They all have a few issues that matter, but using <code>git worktree</code> is probably going to be your best bet. <ul> <li> A clone, let's call this AD for after-dependency clone, made with <code>--reference local-path</code> but without <code>--dissociate</code> uses objects from <code>local-path</code>. By "objects", I mean literal Git objects (stored loosely and/or in pack files). The other Git repository—the one in <code>local-path</code>—has no idea that AD is using these. Let's call the base clone BC. Now, suppose something happens in BC so that an object is no longer needed, such as deleting a branch name or a remote-tracking name. At this point, a <code>git gc</code> run in BC may garbage-collect and delete the object. If you now switch to the AD clone and run various Git operations, they may fail due to the removed object. The problem is that the older BC clone has no idea that the newer AD clone depends on it. Note that AD has, embedded in it, the path name of BC. If you move BC you must edit the <code>.git/objects/info/alternates</code> file in AD. </li> <li> A work-tree made with <code>git worktree add</code> also uses objects from the original clone. Let's still call the original clone BC, with the added work-trees just called Wb. There are two key differences from the BC/AD setup above: <ul> <li>Each new work-tree Wb literally uses the entire <code>.git</code> directory from BC.</li> <li>The BC repository records the path of each Wb, so it knows about each Wb. You won't have the problem of objects disappearing unexpectedly.</li> <li>However, since BC records each Wb and all the branch names actually live inside BC itself, there's a constraint imposed: whatever branch is checked out in BC cannot be checked out in any Wb. Moreover, Wb1 must be "on" (as in <code>git status</code> says <code>on branch ...</code>) a different branch than Wb2, and so on. (You can be in "detached HEAD" mode, i.e., not on any branch at all, in any or all of BC and each Wb.)</li> </ul> Since BC records each Wb path (and vice versa), if you want to move any of these repositories, you must adjust the paths. </li> </ul>

git worktrees vs "clone --reference"

Tags:

git

git-worktree

What are the pros-cons of using git worktrees vs maintaining multiple clones with --reference flag? The main scenario I am considering is when a developer needs to maintain multiple git repositories on the disk for old releases (release/1.0, release/2.0, release/3.0) because switching branches on a single git repo and rebuilding would be costly.

Using worktrees the developer could have a single clone of the repo, and any old releases could be created as worktrees of the repo using cd /opt/main/, git worktree add /opt/old_release_1 release/1.0. Using reference clones, the developer maintains a main clone somewhere, and uses cd /opt/old_release_1, git clone --reference /opt/main/.git ssh://[email protected]/myrepo.git to create clone repositories for the old releases.

It seems like they can both accomplish the same goal. Are there benefits to one over the other in terms of speed, disk space... other things?

711

asked Jan 17 '18 18:01

mvd

1 Answers

They all have a few issues that matter, but using git worktree is probably going to be your best bet.

A clone, let's call this AD for after-dependency clone, made with --reference local-path but without --dissociate uses objects from local-path. By "objects", I mean literal Git objects (stored loosely and/or in pack files). The other Git repository—the one in local-path—has no idea that AD is using these.

Let's call the base clone BC. Now, suppose something happens in BC so that an object is no longer needed, such as deleting a branch name or a remote-tracking name. At this point, a git gc run in BC may garbage-collect and delete the object.

If you now switch to the AD clone and run various Git operations, they may fail due to the removed object. The problem is that the older BC clone has no idea that the newer AD clone depends on it.

Note that AD has, embedded in it, the path name of BC. If you move BC you must edit the .git/objects/info/alternates file in AD.
A work-tree made with git worktree add also uses objects from the original clone. Let's still call the original clone BC, with the added work-trees just called W_b. There are two key differences from the BC/AD setup above:
- Each new work-tree W_b literally uses the entire .git directory from BC.
- The BC repository records the path of each W_b, so it knows about each W_b. You won't have the problem of objects disappearing unexpectedly.
- However, since BC records each W_b and all the branch names actually live inside BC itself, there's a constraint imposed: whatever branch is checked out in BC cannot be checked out in any W_b. Moreover, W_b1 must be "on" (as in git status says on branch ...) a different branch than W_b2, and so on. (You can be in "detached HEAD" mode, i.e., not on any branch at all, in any or all of BC and each W_b.)
Since BC records each W_b path (and vice versa), if you want to move any of these repositories, you must adjust the paths.

108

answered Oct 22 '22 23:10

torek

Related questions
                            
                                yarn fails during cloning repo: Permission denied (publickey)
                            
                                Ignore files committed to git and also remove them from history
                            
                                What's the format string for `git reflog` default output?
                            
                                Check if a git repo exists in a shell script
                            
                                Commit changes missing from git merge
                            
                                Git syntax for pull request on the command line
                            
                                Error: pathspec did not match any file(s) known to git
                            
                                Will Git ever clone, fetch, or push orphan commits?
                            
                                What happens to git history if multiple people work on a feature branch but it is squash and merged into master?
                            
                                List remote branches - git branch -a vs git ls-remote --heads origin
                            
                                Any example to use git merge patience strategy?
                            
                                What value to give to -m switch in git revert?
                            
                                GIT Submodules in VSTS online?
                            
                                How to take backup of multiple stash?
                            
                                How to merge git branches in RStudio
                            
                                Git ignoring .gitignore file in parent directory
                            
                                Git: Recursively switching branch (checkout) on all submodules
                            
                                How to answer the git prompt in npm init for a local repo
                            
                                Is there a way to "freeze" a file in Git?
                            
                                Using git to see all logs related to a specific file extension within subdirectories

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With