I have the same HTML file rendered in two different ways and want to compare it using git diff
, taking care of ignoring every white-space, tab, line-break, carriage-return, or anything that is not strictly the source code of my files.
I'm actually trying this:
git diff --no-index --color --ignore-all-space <file1> <file2>
but when some html tags are collapsed all on one line (instead of one per line and tabulated) git-diff detect is as a difference (while for me it is not).
<html><head><title>TITLE</title><meta ......
is different from
<html> <head> <title>TITLE</title> <meta ......
What option do I miss to accomplish what I need and threat as if it was the same?
The --ignore-trailing-space ( -Z ) option ignores white space at line end. Re: "-w or --ignore-all-space option does not ignore newline-related changes" So -w ignores all whitespace, except for the whitespace it doesn't ignore.
Use the git diff Command to Ignore Whitespaces in Git We use the git diff -w command to ignore all whitespace differences. It will ignore spaces at the beginning, middle, and end of lines. We use the git diff --ignore-space-at-eol command to ignore whitespace changes at the end of our lines.
^M represents carriage return. This diff means something removed a Unicode BOM from the beginning of the line and added a CR at the end.
git diff
supports comparing files line by line or word by word, and also supports defining what makes a word. Here you can define every non-space character as a word to do the comparison. In this way, it will ignore all spaces including white-spcae, tab, line-break and carrige-return as what you need.
To achieve it, there's a perfect option --word-diff-regex
, and just set it --word-diff-regex=[^[:space:]]
. Refer to doc for detail.
git diff --no-index --word-diff-regex=[^[:space:]] <file1> <file2>
Here's an example. I created two files, with a.html
as follows:
<html><head><title>TITLE</title><meta>
With b.html
as follows:
<html> <head> <title>TI==TLE</title> <meta>
By running
git diff --no-index --word-diff-regex=[^[:space:]] a.html b.html
It highlights the difference of TITLE
and TI{+==+}TLE
in the two files in plain
mode as follows. You can also specify --word-diff=<mode>
to display results in different modes. The mode
can be color
, plain
, porcelain
and none
, and with plain
as default.
diff --git a/d.html b/a.html index df38a78..306ed3e 100644 --- a/d.html +++ b/a.html @@ -1 +1,4 @@ <html> <head> <title>TI{+==+}TLE</title> <meta>
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With