I am playing with the Unix hexdump utility. My input file is UTF-8 encoded, containing a single character <code>ñ</code>, which is <code>C3 B1</code> in hexadecimal UTF-8. <pre class="prettyprint"><code>hexdump test.txt 0000000 b1c3 0000002 </code></pre> Huh? This shows <code>B1 C3</code> - the inverse of what I expected! Can someone explain? For getting the expected output I do: <pre class="prettyprint"><code>hexdump -C test.txt 00000000 c3 b1 |..| 00000002 </code></pre> I was thinking I understood encoding systems.

This is because hexdump defaults to using 16-bit words and you are running on a little-endian architecture. The byte sequence <code>b1 c3</code> is thus interpreted as the hex word <code>c3b1</code>. The <code>-C</code> option forces hexdump to work with bytes instead of words.

hexdump confusion

Tags:

hexdump

I am playing with the Unix hexdump utility. My input file is UTF-8 encoded, containing a single character ñ, which is C3 B1 in hexadecimal UTF-8.

hexdump test.txt 0000000 b1c3 0000002

Huh? This shows B1 C3 - the inverse of what I expected! Can someone explain?

For getting the expected output I do:

hexdump -C test.txt 00000000  c3 b1                                             |..| 00000002

I was thinking I understood encoding systems.

312

asked May 17 '10 07:05

zedoo

1 Answers

This is because hexdump defaults to using 16-bit words and you are running on a little-endian architecture. The byte sequence b1 c3 is thus interpreted as the hex word c3b1. The -C option forces hexdump to work with bytes instead of words.

150

answered Sep 21 '22 11:09

Marcelo Cantos

Related questions
                            
                                Read from serial port and store in hexadecimal
                            
                                Shell magic wanted: format output of hexdump in a pipe
                            
                                importing hex stream into wireshark
                            
                                Hex dump parsing in perl
                            
                                What's a Hex Dump - What does it mean?
                            
                                Hexdump -C but decimal instead of hex
                            
                                Output file with one byte per line in hex format (under linux bash)
                            
                                pythonic way to hex dump files
                            
                                True memory locations of C variables
                            
                                What are these differences in two DLL file generated from the same source code
                            
                                Endianness in Unix hexdump
                            
                                How can I change the number of columns printed by `hexdump`?
                            
                                Viewing blob data as a hexdump with ASCII in the sqlite3 console
                            
                                How to create binary file using Bash?
                            
                                Hexdump reverse command
                            
                                Convert a binary string to Hexadecimal and vice-versa in Elixir
                            
                                Use binwalk to extract all files
                            
                                Using hexdump to output only ASCII
                            
                                how to get hexdump of a structure data
                            
                                How to print only the hex values from hexdump without the line numbers or the ASCII table? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With