I truly believe that to have one commit on one issue is a good practice. I'm sure I read it somewhere in an article like “Best practices”. As such, my workflow has been the following: <ul> <li>For a new issue, I create a new local branch with <code>git checkout -b new-issue</code>.</li> <li>Commit all changes into it. Sometimes this involves lots of commits.</li> <li>When done, I <code>squash</code> the commits and <code>rebase</code> them into current thematic branch.</li> <li>If something goes wrong, I can <code>git revert</code> the commit, find the bug, fix it, and commit new patch into the thematic branch. I won't change the remote repository’s history.</li> </ul> But today, I was surprised to hear the following workflow: <ul> <li>Create new branch for the new issue.</li> <li>Commit everything into it.</li> <li>Use <code>merge --no-ff</code> to merge the issue branch with thematic branch (so we’ll have “merge-commit” that we can <code>revert</code>).</li> <li>If something goes wrong, we can use <code>git bisect</code> to find the bug.</li> </ul> <hr> According to the 1st approach, we’ll have a clean git history, and no idea about overhead branches used during development. According to the 2nd approach, we’ll have a very messy history, with a lot of ugly, unnecessary merges and commits for just one issue. However, we can use <code>git bisect</code> to find bugs. (Perhaps this is better for refactoring?) <hr> <ul> <li>What pros and cons do you see for both approaches? </li> <li>Which approach do you use, and why?</li> <li>In practice, have you actually used <code>git bisect</code> to find bugs? (I haven't…)</li> </ul>

The second approach doesn't have to have a lot of ugly and unnecessary merges and commits. This is what I prefer to do: <ol> <li>create a new topic branch</li> <li>make a bunch of commits</li> <li>just before merging back to the parent branch, clean up the commits: <ul> <li>rebase onto the latest version of the parent branch</li> <li>squash typo fix commits</li> <li>split commits doing multiple things at once into separate commits</li> <li>reorder the commits to make it easier for a reviewer to understand the sequence of changes</li> <li>etc.</li> </ul> </li> <li>merge with <code>--no-ff</code> into the parent branch</li> </ol> The above steps result in a history that looks like this: <pre class="prettyprint"><code>* 354b644 Merge branch 'topic3' |\ | * 54527e0 remove foo now that it is no longer used | * 1ef3dad stop linking against foo | * 7dfc7e5 wrap lines longer than 80 characters, no other changes | * b45fbcf delete end-of-line whitespace, fix indendataion |/ * db13612 Merge branch 'topic2' |\ | * 961eebf unbreak build by adding a missing semicolon |/ * a5b6b16 Merge branch 'topic1' |\ ... (more history not shown) </code></pre> The above graph has all the same advantages of approach #1: <ul> <li> You can use the <code>--first-parent</code> argument to <code>git log</code> to get a concise summary that resembles what you would get with approach #1: <pre class="prettyprint"><code>* 354b644 Merge branch 'topic3' * db13612 Merge branch 'topic2' * a5b6b16 Merge branch 'topic1' ... (more history not shown) </code></pre> </li> <li>You can still easily examine the entirety of changes made in a topic branch. For example, <code>git diff 354b644^..354b644</code> will show you what was changed for topic #3.</li> </ul> But you get benefits that approach #1 can't give you: <ul> <li>The history is much easier to review: commits <code>b45fbcf</code> and <code>7dfc7e5</code> (for the <code>topic3</code> branch) introduce a lot of noise but no actual logic changes. Someone trying to answer the question, "What logic changes were made for topic #3?" might have a hard time digging through the noise if all of those commits were squashed into one.</li> <li>The merge commits nicely identify the context for the series of commits on the merged branch (e.g., this group of commits were made to address topic #3).</li> <li>The finer granularity of commits makes it easier to figure out why a particular change was made, which can help distinguish accidental changes from intentional-but-subtle.</li> <li>If multiple people collaborated on the branch, you can see who they all were and how much each person contributed.</li> <li>The number of commits on the merged topic branch gives you a rough idea about how much was changed.</li> <li>The time range of the commits can provide useful context.</li> <li>You can easily cherry-pick a specific change made onto a different branch (e.g., cherry-pick the minimal change needed to fix a bug onto a release branch).</li> </ul> There is one disadvantage I can think of: It may be hard to configure your software development tools to only follow the first-parent path and ignore all of those intermediate commits. For example, there is no <code>--first-parent</code> argument to <code>git bisect</code>. Also, I'm not familiar enough with Jenkins to know how easy it is to configure it to prioritize building and testing the first-parent path over all the other commits.

What git commit practice is better?

Tags:

git

merge

rebase

commit

bisect

I truly believe that to have one commit on one issue is a good practice. I'm sure I read it somewhere in an article like “Best practices”.

As such, my workflow has been the following:

For a new issue, I create a new local branch with git checkout -b new-issue.
Commit all changes into it. Sometimes this involves lots of commits.
When done, I squash the commits and rebase them into current thematic branch.
If something goes wrong, I can git revert the commit, find the bug, fix it, and commit new patch into the thematic branch. I won't change the remote repository’s history.

But today, I was surprised to hear the following workflow:

Create new branch for the new issue.
Commit everything into it.
Use merge --no-ff to merge the issue branch with thematic branch (so we’ll have “merge-commit” that we can revert).
If something goes wrong, we can use git bisect to find the bug.

According to the 1st approach, we’ll have a clean git history, and no idea about overhead branches used during development.

According to the 2nd approach, we’ll have a very messy history, with a lot of ugly, unnecessary merges and commits for just one issue. However, we can use git bisect to find bugs. (Perhaps this is better for refactoring?)

What pros and cons do you see for both approaches?
Which approach do you use, and why?
In practice, have you actually used git bisect to find bugs? (I haven't…)

833

asked Mar 21 '13 16:03

Viacheslav Kondratiuk

1 Answers

The second approach doesn't have to have a lot of ugly and unnecessary merges and commits. This is what I prefer to do:

create a new topic branch
make a bunch of commits
just before merging back to the parent branch, clean up the commits:
- rebase onto the latest version of the parent branch
- squash typo fix commits
- split commits doing multiple things at once into separate commits
- reorder the commits to make it easier for a reviewer to understand the sequence of changes
- etc.
merge with --no-ff into the parent branch

The above steps result in a history that looks like this:

*   354b644 Merge branch 'topic3'
|\
| * 54527e0 remove foo now that it is no longer used
| * 1ef3dad stop linking against foo
| * 7dfc7e5 wrap lines longer than 80 characters, no other changes
| * b45fbcf delete end-of-line whitespace, fix indendataion
|/
*   db13612 Merge branch 'topic2'
|\
| * 961eebf unbreak build by adding a missing semicolon
|/
*   a5b6b16 Merge branch 'topic1'
|\
... (more history not shown)

The above graph has all the same advantages of approach #1:

You can use the --first-parent argument to git log to get a concise summary that resembles what you would get with approach #1:

* 354b644 Merge branch 'topic3'
* db13612 Merge branch 'topic2'
* a5b6b16 Merge branch 'topic1'
... (more history not shown)

You can still easily examine the entirety of changes made in a topic branch. For example, git diff 354b644^..354b644 will show you what was changed for topic #3.

But you get benefits that approach #1 can't give you:

The history is much easier to review: commits b45fbcf and 7dfc7e5 (for the topic3 branch) introduce a lot of noise but no actual logic changes. Someone trying to answer the question, "What logic changes were made for topic #3?" might have a hard time digging through the noise if all of those commits were squashed into one.
The merge commits nicely identify the context for the series of commits on the merged branch (e.g., this group of commits were made to address topic #3).
The finer granularity of commits makes it easier to figure out why a particular change was made, which can help distinguish accidental changes from intentional-but-subtle.
If multiple people collaborated on the branch, you can see who they all were and how much each person contributed.
The number of commits on the merged topic branch gives you a rough idea about how much was changed.
The time range of the commits can provide useful context.
You can easily cherry-pick a specific change made onto a different branch (e.g., cherry-pick the minimal change needed to fix a bug onto a release branch).

There is one disadvantage I can think of: It may be hard to configure your software development tools to only follow the first-parent path and ignore all of those intermediate commits. For example, there is no --first-parent argument to git bisect. Also, I'm not familiar enough with Jenkins to know how easy it is to configure it to prioritize building and testing the first-parent path over all the other commits.

answered Oct 22 '22 17:10

Richard Hansen

Related questions
                            
                                Is a develop branch useless in git-flow?
                            
                                does Java have API wrappers around subversion and Git?
                            
                                Someone sent me a pull request, what do I do?
                            
                                Clone works, remote push doesn't. Remote repository over copssh
                            
                                Making a git project read only
                            
                                git commit - format?
                            
                                Using git filter-branch to remove commits by their commit message
                            
                                git revert in Egit
                            
                                Error when cloning git "shallow" repository
                            
                                Understanding Git and how to use EGit (git eclipse plugin)
                            
                                How can you tell if a git stash is no longer required?
                            
                                How to set specific path permissions using Git?
                            
                                Is there a way to force the format of a README.txt file on github?
                            
                                git commit auto add new folders or files?
                            
                                bitbucket stripped git revisions
                            
                                Is it safe to use the same ignores file for Git, Mercurial, and Bazaar?
                            
                                How does the git staging area store files?
                            
                                Can I destroy and recreate a Git remote branch in one command?
                            
                                Should Git be used to store continuous integration builds?
                            
                                git diff - show only what's new on the remote

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With