If I cherry-pick from multiple branches, is there a simple way to figure out where the commit was coming from (e.g. the sha of the original commit)? Example: - at master branch - cherry pick commit A from a dev branch - A becomes D at the master branch Before: <pre class="prettyprint"><code>* B (master) Feature Y | * C (dev) Feature Z | * A Feature X |/ * 3 * 2 * 1 </code></pre> After: <pre class="prettyprint"><code>* D (master) Feature X * B Feature Y | * C (dev) Feature Z | * A Feature X |/ * 3 * 2 * 1 </code></pre> Is it possible to figure out that D was cherry-picked from A (aside from searching for the commit message)? Edit: Although I will go with daggy-fixes (see VonCs answer) I accepted Chris Johnsens answer because it is closer to the actual question. Thanks guys.

By default, the information about the original, “cherry” commit is not recorded as part of the new commit. <h3>Record the Source Commit in the Commit Message</h3> If you can force the use of particular workflows/options, git cherry-pick has the <code>-x</code> option: <blockquote> When recording the commit, append to the original commit message a note that indicates which commit this change was cherry-picked from. </blockquote> This is obviously useless if you can not rely on the cherry pickers using the option. Also, since the recorded information is just plain text—not an actual reference as far as Git is concerned—even if you use <code>-x</code>, you still have to take steps to make sure that the original commit is kept alive (e.g. is is part of the DAG of a tag or a non-rewinding branch). <h3> git cherry and git patch-id </h3> If you can restrict your search to two particular branches of the history DAG, then git cherry can find both “unpicked” and “picked” cherries. Note: This command (and the related git patch-id) can only identify conflict-free cherries that were individually picked without extra changes. If there was a conflict while picking the cherry (e.g. you had to slightly modify it to get it to apply), or you used <code>-n</code>/<code>--no-commit</code> to stage extra changes (e.g. multiple cherries in a single commit), or the content of the commit was rewritten after the picking, then you will have to rely on commit message comparison (or the <code>-x</code> information if it was recorded). git cherry is not really designed to identify the origin of picked cherries, but we can abuse it a bit to identify single cherry pairs. Given the following history DAG (as in the original poster’s example): <pre class="prettyprint"><code>1---2---3---B---D master \ A---C dev # D is a cherry-picked version of C </code></pre> you will see something like this: <pre class="prettyprint"><code>% git cherry master dev + A - C % git cherry dev master + B - D </code></pre> (A, B, C, and D are full SHA-1 hashes in the real output) Since we see one cherry (the <code>-</code> lines) in each list, they must form a cherry pair. D was a cherry picked from C (or vice versa; you can not tell by the DAG alone, though the commit dates might help). If you are dealing with more than one potential cherry, you will have to “roll your own” program to do the mapping. The code should be easy in any language with associative arrays, hashes, dictionaries, or equivalent. In awk, it might look like this: <pre class="prettyprint"><code>match_cherries() { a="$(git rev-parse --verify "$1")" && b="$(git rev-parse --verify "$2")" && git rev-list "$a...$b" | xargs git show | git patch-id | awk ' { p[$1] = p[$1] " " $2 } END { for (i in p) { l=length(p[i]) if (l>41) print substr(p[i],2,l-1) } }' } match_cherries master dev </code></pre> With an extended example that has two picked cherries: <pre class="prettyprint"><code>1---2---3---B---D---E master \ A---C dev # D is a cherry-picked version of C # E is a cherry-picked version of A </code></pre> The output might look like this: <pre class="prettyprint"><code>match_cherries master dev D C E A </code></pre> (A, C, D, and E are full SHA-1 hashes in the real output) This tells us that C and D represent the same change and that E and A represent the same change. As before, there is no way to tell which of each pair was “the first” unless you also consider (e.g.) the commit dates of each commit. <h3>Commit Message Comparison</h3> If your cherries were not picked with <code>-x</code>, or they are “dirty” (had conflicts, or other changes added to them (i.e. with <code>--no-commit</code> plus staging extra changes, or with <code>git commit --amend</code> or other “history rewriting” mechanism)), then you may have to fall back on less the less reliable technique of comparing commit messages. This technique works best if you can find some bit of the commit message that is likely to be unique to the commit and is unlikely to have changed in the commit that resulted from the cherry pick. The bit that would work best would depend on the style of commit messages used in your project. Once you have picked out an “identifying part” of the message, you can use git log to find commits (also demonstrated in Jefromi’s answer). <pre class="prettyprint"><code>git log --grep='unique part of the commit message' dev...master </code></pre> The argument to <code>--grep</code> is actually a regular expression, so you might need to escape any regexp metacharacters (<code>[]*?.\</code>). If you are not sure which branches might hold the original commit and the new commit, you can use <code>--all</code> as Jefromi showed.

Git: Is there a way to figure out where a commit was cherry-pick'ed from?

Tags:

git

version-control

If I cherry-pick from multiple branches, is there a simple way to figure out where the commit was coming from (e.g. the sha of the original commit)?

Example:
- at master branch
- cherry pick commit A from a dev branch
- A becomes D at the master branch

Before:

* B (master) Feature Y | * C (dev) Feature Z | * A Feature X |/ * 3 * 2 * 1

After:

* D (master) Feature X * B Feature Y | * C (dev) Feature Z | * A Feature X |/ * 3 * 2 * 1

Is it possible to figure out that D was cherry-picked from A (aside from searching for the commit message)?

Edit:
Although I will go with daggy-fixes (see VonCs answer) I accepted Chris Johnsens answer because it is closer to the actual question. Thanks guys.

693

asked May 27 '10 15:05

EricSchaefer

1 Answers

By default, the information about the original, “cherry” commit is not recorded as part of the new commit.

Record the Source Commit in the Commit Message

If you can force the use of particular workflows/options, git cherry-pick has the -x option:

When recording the commit, append to the original commit message a note that indicates which commit this change was cherry-picked from.

This is obviously useless if you can not rely on the cherry pickers using the option. Also, since the recorded information is just plain text—not an actual reference as far as Git is concerned—even if you use -x, you still have to take steps to make sure that the original commit is kept alive (e.g. is is part of the DAG of a tag or a non-rewinding branch).

git cherry and git patch-id

If you can restrict your search to two particular branches of the history DAG, then git cherry can find both “unpicked” and “picked” cherries.

Note: This command (and the related git patch-id) can only identify conflict-free cherries that were individually picked without extra changes. If there was a conflict while picking the cherry (e.g. you had to slightly modify it to get it to apply), or you used -n/--no-commit to stage extra changes (e.g. multiple cherries in a single commit), or the content of the commit was rewritten after the picking, then you will have to rely on commit message comparison (or the -x information if it was recorded).

git cherry is not really designed to identify the origin of picked cherries, but we can abuse it a bit to identify single cherry pairs.

Given the following history DAG (as in the original poster’s example):

1---2---3---B---D  master          \           A---C    dev # D is a cherry-picked version of C

you will see something like this:

% git cherry master dev + A - C % git cherry dev master + B - D

_{(A, B, C, and D are full SHA-1 hashes in the real output)}

Since we see one cherry (the - lines) in each list, they must form a cherry pair. D was a cherry picked from C (or vice versa; you can not tell by the DAG alone, though the commit dates might help).

If you are dealing with more than one potential cherry, you will have to “roll your own” program to do the mapping. The code should be easy in any language with associative arrays, hashes, dictionaries, or equivalent. In awk, it might look like this:

match_cherries() {     a="$(git rev-parse --verify "$1")" &&     b="$(git rev-parse --verify "$2")" &&     git rev-list "$a...$b" | xargs git show | git patch-id |     awk '         { p[$1] = p[$1] " " $2 }     END {              for (i in p) {                 l=length(p[i])                 if (l>41) print substr(p[i],2,l-1)             }         }' } match_cherries master dev

With an extended example that has two picked cherries:

1---2---3---B---D---E  master          \           A---C        dev # D is a cherry-picked version of C # E is a cherry-picked version of A

The output might look like this:

match_cherries master dev D C E A

_{(A, C, D, and E are full SHA-1 hashes in the real output)}

This tells us that C and D represent the same change and that E and A represent the same change. As before, there is no way to tell which of each pair was “the first” unless you also consider (e.g.) the commit dates of each commit.

Commit Message Comparison

If your cherries were not picked with -x, or they are “dirty” (had conflicts, or other changes added to them (i.e. with --no-commit plus staging extra changes, or with git commit --amend or other “history rewriting” mechanism)), then you may have to fall back on less the less reliable technique of comparing commit messages.

This technique works best if you can find some bit of the commit message that is likely to be unique to the commit and is unlikely to have changed in the commit that resulted from the cherry pick. The bit that would work best would depend on the style of commit messages used in your project.

Once you have picked out an “identifying part” of the message, you can use git log to find commits (also demonstrated in Jefromi’s answer).

git log --grep='unique part of the commit message' dev...master

The argument to --grep is actually a regular expression, so you might need to escape any regexp metacharacters ([]*?.\).

If you are not sure which branches might hold the original commit and the new commit, you can use --all as Jefromi showed.

answered Oct 14 '22 13:10

Chris Johnsen

Related questions
                            
                                Best workflow when forking and renaming a GitHub project [closed]
                            
                                Git: what does the number of +/- signs in diff / merge output mean? [duplicate]
                            
                                Best practices for storing kubernetes configuration in source control [closed]
                            
                                Preview a Git push
                            
                                Pull Request, ignore some file changes
                            
                                How can I blame a deleted file in Git?
                            
                                Git Shell in Windows: patch's default character encoding is UCS-2 Little Endian - how to change this to ANSI or UTF-8 without BOM?
                            
                                Add Github remote to GitKraken
                            
                                Strategy for preventing or catching git history rewrite
                            
                                rebase reverted merged branch
                            
                                Git blame committed line
                            
                                What are the differences between 'revert', 'amend,' 'rollback', and 'undo' a commit?
                            
                                git update-index --assume-unchanged and git reset
                            
                                How to disambiguate an ambiguous abbreviated sha1 in git
                            
                                TortoiseGit-git did not exit cleanly (exit code 1)
                            
                                What does the exclamation mark mean in git config alias?
                            
                                Channel is not opened Errors in automatic merge on Teamcity (git)
                            
                                Github commit message links
                            
                                Change github diff to be side by side comparison?
                            
                                Git and working on multiple branches

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With