I have a file with unicode symbols (russian text).
When I fix some typo I use git diff --color-words=.
to see the changes I've done.
In case of unicode (cyrillic) symbols I get some mess with angle brackets like so:
$ cat p1
привет
$ cat p2
Привет
$ git diff --color-words=. --no-index p1 p2
diff --git 1/p1 2/p2
index d0f56e1..d84c480 100644
--- 1/p1
+++ 2/p2
@@ -1 +1 @@
<D0><BF><9F>ривет
It looks like git diff --color-words=.
is checking the difference between bytes and not between symbols as I expect.
Is there any way to tell git
to work properly with unicode symbols?
UPD about my environment: I get the same on Mac OS and on Linux host.
My shell vars are:
BASH=/bin/bash
HOSTTYPE=x86_64
LANG=ru_RU.UTF-8
OSTYPE=darwin10.0
PS1='\h:\W \u\$ '
SHELL=/bin/bash
SHELLOPTS=braceexpand:emacs:hashall:histexpand:history:interactive-comments:monitor
TERM=xterm-256color
TERM_PROGRAM=iTerm.app
_=-l
I have reset git config to default settings like so:
$ git config -l
core.repositoryformatversion=0
core.filemode=true
core.bare=false
core.logallrefupdates=true
core.ignorecase=true
git version
$ git --version
git version 1.7.3.5
For me less
— the git pager — was to blame (thanks @kostix). Experiment by disabling the pager altogether:
git --no-pager diff p1 p2
My case was commit messages containing emojis; it's fundamentally the same problem though.
$ git log --oneline
93a1866 <U+1F43C>
$ git --no-pager log --oneline
93a1866 🐼
$ export LESS='--raw-control-chars'
$ git log --oneline
93a1866 🐼
$ git config --global core.pager 'less --raw-control-chars'
$ git log --oneline
93a1866 🐼
NB: the --RAW-CONTROL-CHARS
option causes less
to pass through ANSI color escapes, but will still munge other control chars (emoji included). My less
is globally configured with --RAW-CONTROL-CHARS
and my git pager with --raw-control-chars
as above.
For me best solution to this is setting export LESSCHARSET=utf-8
.
In this case both git log -p
and git diff
shows unicode without problems.
The solution for me was to use git difftool.
I wrote this tool https://github.com/chestozo/dmp based on https://code.google.com/p/google-diff-match-patch/.
Sometimes it also gives better diff comparing to git diff --color-words=.
:)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With