Please note: I first took a look at this question hoping it would answer my question, but I think mine is slightly different! <hr> I was told that running <code>git remote update origin --prune</code> locally would force my local repo to have the exact same branches as the origin repo. However I can find loads of info/documentation surrounding the use of <code>git remote prune origin</code>, but not <code>git remote update origin --prune</code>... Is my understanding of this command correct? If not, how am I misled? And if I am correct, then what happens if I have local feature branches that haven't been pushed remotely yet? What happens if I have changes to branches that do existing remotely/on the origin but have I have not pushed those changes yet? Thanks in advance!

<h3>TL;DR</h3> The <code>git remote update remote</code> command is the same as the <code>git fetch remote</code> command. (There were some bugs in some versions of Git that made them different, but they are supposed to do the same thing.) With <code>--prune</code>, the update process simply removes any remote-tracking names that exist in your repository, but no longer correspond to a branch name in the repository at <code>remote</code>. <h3>Long</h3> <blockquote> I was told that running <code>git remote update origin --prune</code> locally would force my local repo to have the exact same branches as the origin repo. </blockquote> That's not quite right. Well, that's not at all right, really. And this does tie in to the way you've phrased your question title: <blockquote> Effects ... on local changes </blockquote> When using Git, it's important to recognize several things that are odd / different-from most other version control systems. The first is that branches, or branch names more precisely, don't really matter that much. Branch names like <code>master</code> and <code>develop</code> are mostly just for use by humans, and you can do pretty much anything you want with them, including changing them around arbitrarily. (There are a few exceptions that we'll get to in a bit.) What you can't change—and what really matters to Git—are commit hash IDs. You will have seen these things in <code>git log</code> output: they're big ugly number-and-letter strings like <code>b5101f929789889c2e536d915698f58d5c5c6b7a</code>, which is a commit in the Git repository for Git. These numbers (these are actually hexadecimal representations) are unique and universal: every Git everywhere will use that number for that commit. If you're not working with a clone of this particular Git repository, you won't have that commit. If you are working with a clone of this particular Git repository and don't have this commit, your Git just needs to connect to another Git that does have this commit, and you'll get this commit. These numbers are how Git identifies and finds commits. They're what Git really cares about: the commits themselves, and these unique hash IDs to identify them. Each commit contains some number of other, earlier commits' hash IDs. Usually each commit has just one earlier commit hash ID. That one earlier commit ID is the parent of the commit. So given some commit hash ID, your Git can tell if you have the commit. If so, your Git can fish out the parent hash ID, and tell if you have that commit, and then fish out its parent ID, and so on. What this means is that given any one commit (by hash ID), your Git can find all the commits, one by one, working backwards. This makes for an easy way to draw them: <pre class="prettyprint"><code>... <-F <-G <-H </code></pre> where <code>H</code> stands in for some commit hash. Git will find the actual commit, read out its parent hash ID <code>G</code>, read the commit, find its parent <code>F</code>, and so on. The action stops when Git gets back to the very first commit ever made, which has no parent hash. Nothing in any commit can ever change. (In part, that's because the actual hash ID is a cryptographic checksum of all of the contents of the commit. If you were to change anything—even a single bit—you'd get a new, different hash for a new, different commit.) So since <code>H</code> store's <code>G</code>'s hash ID, <code>H</code> will always, forever, point back to <code>G</code>. The one-way-ness of the connecting arrows is therefore mostly irrelevant and we can stop drawing them; we just need to remember that Git itself has to work backwards. A branch name, like <code>master</code> or <code>develop</code>, is just a way that your Git offers to let you help remember one of these hash IDs. It only remembers one of them! The one it remembers is, by definition, the latest one. If you change the number associated with your <code>master</code>, you've told your Git: Use a different commit as the latest one. Hence we augment the picture a bit: <pre class="prettyprint"><code>...--F--G--H <-- master </code></pre> Thus, the hash ID in <code>H</code> is that of the last commit that Git should treat as being "on" branch <code>master</code>. When you make a new commit, starting with <code>git checkout master</code> and doing all the usual stuff up to the point where you run <code>git commit</code>, Git writes out a new commit. This new commit gets a new, unique hash ID, which we can call <code>I</code>: <pre class="prettyprint"><code>...--F--G--H <-- master \ I </code></pre> Having written out the commit and found its cryptographic-checksum hash ID, Git now writes that hash ID into the name <code>master</code>, so that <code>master</code> points to <code>I</code>: <pre class="prettyprint"><code>...--F--G--H \ I <-- master </code></pre> and now <code>I</code> is the tip commit of <code>master</code>. You / your Git can still find commit <code>H</code>, by starting at <code>I</code> and walking back one step. If something were to overwrite your <code>master</code> and make it point back to <code>H</code>, it would become very difficult to find commit <code>I</code>, because the internal arrows all point backwards. <code>I</code> points to <code>H</code>—the child knows its parent—but <code>H</code> has no idea that <code>I</code> exists at all. (It didn't when whoever made <code>H</code>, made it, and <code>H</code> can't change now!) For this reason (among other reasons), your branch names are yours. No one but you and your Git can write new numbers into them. The hash IDs of actual commits are unique and universal: if someone else makes a new commit, that new commit gets a new hash ID that no other commit will ever have, and your Git will know if you have that commit or not. If you need it, your Git will get it from their Git when you connect your Git to their Git. But your branch names are yours. So <code>git fetch</code> or <code>git remote update</code>—again, both do the same thing—will call up some other Git, and get any new commits from them that you don't have. Let's say you <code>git fetch origin</code> to call up a Git at the URL you're calling <code>origin</code>. If you added commit <code>I</code> to your <code>master</code>, and they added commit <code>J</code> to theirs, your Git now has: <pre class="prettyprint"><code>...--F--G--H--J <-- ??? \ I <-- master </code></pre> How will your Git find commit <code>J</code>? Writing that hash ID into your <code>master</code> is obviously a bad idea: that loses <code>I</code>! So where can your Git write that hash ID? Git's answer to this is remote-tracking names (which most people call remote-tracking branches, but I think Git already uses the word branch too much). Your Git's name, with which your Git remembers <code>origin</code>'s <code>master</code>, is <code>origin/master</code>, and the picture should read: <pre class="prettyprint"><code>...--F--G--H--J <-- origin/master \ I <-- master </code></pre> If you want to add commit <code>I</code>—or another different commit that's just as good—to their master, you now have to decide, do you really want <code>I</code> itself, or another commit that's just as good? If you want to keep <code>I</code> itself, you'd merge <code>I</code> and <code>J</code> to make a new commit that has two parents: <pre class="prettyprint"><code>...--F--G--H--J <-- origin/master \ \ I--M <-- master </code></pre> Since merge commit <code>M</code> has arrows back to both <code>J</code> and <code>I</code>, you can now send them commit <code>M</code> and ask their Git to set their <code>master</code> to point to commit <code>M</code>. <h3>Your Git has a remote-tracking name for each of their branches</h3> When you run <code>git fetch origin</code> or <code>git remote update origin</code>, your Git calls up the other Git at <code>origin</code> and asks for a list of all of its branches. Their branches are of course theirs, but your Git would like to get any new commits they've made and remember the latest one for you. So if they have <code>master</code>, <code>develop</code>, <code>feature/A</code>, and <code>feature/B</code>, your Git gets any commits they have that you don't and remembers their branch tips under your <code>origin/*</code> names, corresponding to their branch names: <pre class="prettyprint"><code> L--N <-- origin/feature/A / ...--F--G--H--J <-- origin/master \ I <-- master </code></pre> If they've deliberately removed some commit(s) from some of their branches by writing an older hash ID into their branch name(s), your Git will adjust your remote-tracking branches correspondingly. In some cases, this may result in your Git forgetting their commits (if you've never bothered to make your own names to save them): <pre class="prettyprint"><code> N [abandoned] / L <-- origin/feature/A / ...--F--G--H--J <-- origin/master \ I <-- master </code></pre> Commit <code>N</code> is no longer findable, and eventually <code>git gc</code> will remove it entirely from your repository. (This is where branch names, or other names, become important: they not only let you find the commits, they also protect the tip commit from the Grim <strike>Reaper</strike> Collector. Those tip commits protect their parents, who protect commits further back in the chain, all the way to the root commit.) Suppose, though, that at some point they decide that feature <code>A</code> is finished. They have merged it back into their master, or fast-forwarded their master to point to it directly: <pre class="prettyprint"><code>...--F--G--H--J--L <-- master, feature/A [no origin/ -- this is THEIR Git!] </code></pre> which in your Git is: <pre class="prettyprint"><code>...--F--G--H--J--L <-- origin/master, origin/feature/A \ I <-- master </code></pre> They may now delete their <code>feature/A</code> name entirely. When they do, your Git stops seeing new values for your <code>origin/feature/A</code>. Without <code>--prune</code>, however, your Git *doesn't remove your <code>origin/feature/A</code>. So you continue to remember <code>origin/feature/A</code> as specifying commit <code>L</code>. This doesn't do any real harm; you'll just think, based on looking at your <code>origin/*</code> names, that they still have <code>feature/A</code> and that it means commit <code>L</code>. Whether to use <code>--prune</code> is up to you. I turn it on automatically in my own Git configuration; this keeps my repositories less-cluttered.

Effects of git remote update origin --prune on local changes

Tags:

git

Please note: I first took a look at this question hoping it would answer my question, but I think mine is slightly different!

I was told that running git remote update origin --prune locally would force my local repo to have the exact same branches as the origin repo. However I can find loads of info/documentation surrounding the use of git remote prune origin, but not git remote update origin --prune...

Is my understanding of this command correct? If not, how am I misled? And if I am correct, then what happens if I have local feature branches that haven't been pushed remotely yet? What happens if I have changes to branches that do existing remotely/on the origin but have I have not pushed those changes yet?

Thanks in advance!

467

asked Apr 17 '19 16:04

hotmeatballsoup

1 Answers

TL;DR

The git remote update remote command is the same as the git fetch remote command. (There were some bugs in some versions of Git that made them different, but they are supposed to do the same thing.) With --prune, the update process simply removes any remote-tracking names that exist in your repository, but no longer correspond to a branch name in the repository at remote.

Long

I was told that running git remote update origin --prune locally would force my local repo to have the exact same branches as the origin repo.

That's not quite right. Well, that's not at all right, really. And this does tie in to the way you've phrased your question title:

Effects ... on local changes

When using Git, it's important to recognize several things that are odd / different-from most other version control systems. The first is that branches, or branch names more precisely, don't really matter that much. Branch names like master and develop are mostly just for use by humans, and you can do pretty much anything you want with them, including changing them around arbitrarily. (There are a few exceptions that we'll get to in a bit.)

What you can't change—and what really matters to Git—are commit hash IDs. You will have seen these things in git log output: they're big ugly number-and-letter strings like b5101f929789889c2e536d915698f58d5c5c6b7a, which is a commit in the Git repository for Git. These numbers (these are actually hexadecimal representations) are unique and universal: every Git everywhere will use that number for that commit. If you're not working with a clone of this particular Git repository, you won't have that commit. If you are working with a clone of this particular Git repository and don't have this commit, your Git just needs to connect to another Git that does have this commit, and you'll get this commit.

These numbers are how Git identifies and finds commits. They're what Git really cares about: the commits themselves, and these unique hash IDs to identify them.

Each commit contains some number of other, earlier commits' hash IDs. Usually each commit has just one earlier commit hash ID. That one earlier commit ID is the parent of the commit. So given some commit hash ID, your Git can tell if you have the commit. If so, your Git can fish out the parent hash ID, and tell if you have that commit, and then fish out its parent ID, and so on. What this means is that given any one commit (by hash ID), your Git can find all the commits, one by one, working backwards. This makes for an easy way to draw them:

Click to copy

... <-F <-G <-H

where H stands in for some commit hash. Git will find the actual commit, read out its parent hash ID G, read the commit, find its parent F, and so on. The action stops when Git gets back to the very first commit ever made, which has no parent hash.

Nothing in any commit can ever change. (In part, that's because the actual hash ID is a cryptographic checksum of all of the contents of the commit. If you were to change anything—even a single bit—you'd get a new, different hash for a new, different commit.) So since H store's G's hash ID, H will always, forever, point back to G. The one-way-ness of the connecting arrows is therefore mostly irrelevant and we can stop drawing them; we just need to remember that Git itself has to work backwards.

A branch name, like master or develop, is just a way that your Git offers to let you help remember one of these hash IDs. It only remembers one of them! The one it remembers is, by definition, the latest one. If you change the number associated with your master, you've told your Git: Use a different commit as the latest one. Hence we augment the picture a bit:

Click to copy

...--F--G--H   <-- master

Thus, the hash ID in H is that of the last commit that Git should treat as being "on" branch master.

When you make a new commit, starting with git checkout master and doing all the usual stuff up to the point where you run git commit, Git writes out a new commit. This new commit gets a new, unique hash ID, which we can call I:

Click to copy

...--F--G--H   <-- master
            \
             I

Having written out the commit and found its cryptographic-checksum hash ID, Git now writes that hash ID into the name master, so that master points to I:

Click to copy

...--F--G--H
            \
             I   <-- master

and now I is the tip commit of master. You / your Git can still find commit H, by starting at I and walking back one step.

If something were to overwrite your master and make it point back to H, it would become very difficult to find commit I, because the internal arrows all point backwards. I points to H—the child knows its parent—but H has no idea that I exists at all. (It didn't when whoever made H, made it, and H can't change now!)

For this reason (among other reasons), your branch names are yours. No one but you and your Git can write new numbers into them. The hash IDs of actual commits are unique and universal: if someone else makes a new commit, that new commit gets a new hash ID that no other commit will ever have, and your Git will know if you have that commit or not. If you need it, your Git will get it from their Git when you connect your Git to their Git. But your branch names are yours.

So git fetch or git remote update—again, both do the same thing—will call up some other Git, and get any new commits from them that you don't have. Let's say you git fetch origin to call up a Git at the URL you're calling origin. If you added commit I to your master, and they added commit J to theirs, your Git now has:

Click to copy

...--F--G--H--J   <-- ???
            \
             I   <-- master

How will your Git find commit J? Writing that hash ID into your master is obviously a bad idea: that loses I! So where can your Git write that hash ID?

Git's answer to this is remote-tracking names (which most people call remote-tracking branches, but I think Git already uses the word branch too much). Your Git's name, with which your Git remembers origin's master, is origin/master, and the picture should read:

Click to copy

...--F--G--H--J   <-- origin/master
            \
             I   <-- master

If you want to add commit I—or another different commit that's just as good—to their master, you now have to decide, do you really want I itself, or another commit that's just as good? If you want to keep I itself, you'd merge I and J to make a new commit that has two parents:

Click to copy

...--F--G--H--J   <-- origin/master
            \  \
             I--M   <-- master

Since merge commit M has arrows back to both J and I, you can now send them commit M and ask their Git to set their master to point to commit M.

Your Git has a remote-tracking name for each of their branches

When you run git fetch origin or git remote update origin, your Git calls up the other Git at origin and asks for a list of all of its branches. Their branches are of course theirs, but your Git would like to get any new commits they've made and remember the latest one for you. So if they have master, develop, feature/A, and feature/B, your Git gets any commits they have that you don't and remembers their branch tips under your origin/* names, corresponding to their branch names:

Click to copy

                L--N   <-- origin/feature/A
               /
...--F--G--H--J   <-- origin/master
            \
             I   <-- master

If they've deliberately removed some commit(s) from some of their branches by writing an older hash ID into their branch name(s), your Git will adjust your remote-tracking branches correspondingly. In some cases, this may result in your Git forgetting their commits (if you've never bothered to make your own names to save them):

Click to copy

                  N   [abandoned]
                 /
                L   <-- origin/feature/A
               /
...--F--G--H--J   <-- origin/master
            \
             I   <-- master

Commit N is no longer findable, and eventually git gc will remove it entirely from your repository. (This is where branch names, or other names, become important: they not only let you find the commits, they also protect the tip commit from the Grim ~~Reaper~~ Collector. Those tip commits protect their parents, who protect commits further back in the chain, all the way to the root commit.)

Suppose, though, that at some point they decide that feature A is finished. They have merged it back into their master, or fast-forwarded their master to point to it directly:

Click to copy

...--F--G--H--J--L   <-- master, feature/A   [no origin/ -- this is THEIR Git!]

which in your Git is:

Click to copy

...--F--G--H--J--L   <-- origin/master, origin/feature/A
            \
             I   <-- master

They may now delete their feature/A name entirely. When they do, your Git stops seeing new values for your origin/feature/A.

Without --prune, however, your Git *doesn't remove your origin/feature/A. So you continue to remember origin/feature/A as specifying commit L. This doesn't do any real harm; you'll just think, based on looking at your origin/* names, that they still have feature/A and that it means commit L.

Whether to use --prune is up to you. I turn it on automatically in my own Git configuration; this keeps my repositories less-cluttered.

109

answered Nov 15 '22 02:11

torek

Related questions
                            
                                Windows .bat file to schedule git add, commit and push to Github
                            
                                Trying to set file name length limit with git - Permission denied
                            
                                Uncommit all commits in current branch but leave all changes at the current state
                            
                                How do I merge changes in Git in files that I moved?
                            
                                How to create a git pull request on a public github repository
                            
                                How to use RUN clone git in dockerfile
                            
                                Delete stashed changes older than X days
                            
                                Why does git uses 2 different commands to show HEAD?
                            
                                golang git pulling a repo
                            
                                How to do submodule sparse-checkout with Git?
                            
                                PowerShell Capture Git Output
                            
                                Ignoring node_modules using .gitignore
                            
                                Can't push to GitHub? (error: git-lfs died of signal 11)
                            
                                How to delete a branch using Bitbucket REST API
                            
                                How can I determine whether a file in git is executable?
                            
                                How do I re-checkout all files in Git to convert from CRLF to LF?
                            
                                Git merge only the diff between two branches
                            
                                How to undo a git commit without losing my files? [duplicate]
                            
                                How to shorten output of 'git pull' command?
                            
                                Move a repository from Github to Gitlab

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Effects of git remote update origin --prune on local changes

Tags:

git

hotmeatballsoup

People also ask

1 Answers

TL;DR

Long

Your Git has a remote-tracking name for each of their branches

torek

Recent Activity

Donate For Us