I was wondering how Windows interprets characters. I made a file with a hex editor with the 3 bytes <code>E3 81 81</code>. Those bytes are the <code>ぁ</code> character in UTF-8. I opened the notepad and it displayed <code>ぁ</code>. I didn't specify the encoding of the file, I just created the bytes and the notepad interpreted it correctly. Is notepad somehow guessing the encoding? Or is the hex editor saving those bytes with a specific encoding?

If the file only contains these three bytes, then there is no information at all about which encoding to use. A byte is just a byte, and there is no way to include any encoding information in it. Besides, the hex editor doesn't even know that you intended to decode the data as text. Notepad normally uses ANSI encoding, so if it reads the file as UTF-8 then it has to guess the encoding based on the data in the file. If you save a file as UTF-8, Notepad will put the BOM (byte order mark) <code>EF BB BF</code> at the beginning of the file.

How does Windows Notepad interpret characters?

1 Answers

If the file only contains these three bytes, then there is no information at all about which encoding to use.

A byte is just a byte, and there is no way to include any encoding information in it. Besides, the hex editor doesn't even know that you intended to decode the data as text.

Notepad normally uses ANSI encoding, so if it reads the file as UTF-8 then it has to guess the encoding based on the data in the file.

If you save a file as UTF-8, Notepad will put the BOM (byte order mark) EF BB BF at the beginning of the file.

121

answered Sep 19 '22 23:09

Guffa

Related questions
                            
                                What do "\\.\", "\??\", "\\?\", "\\" mean?
                            
                                How to find the full path to the Mercurial executable, when Windows is able to locate it?
                            
                                Compile a C++ program with only dependency on kernel32.dll and user32.dll?
                            
                                Can the physical USB port be identified programmatically for a device in Windows?
                            
                                Getting python to print in UTF8 on Windows XP with the console
                            
                                Convert DOC to PDF from Command Line [closed]
                            
                                How to get the rest of arguments in windows batch file?
                            
                                What is the relationship of CloseWindow and WM_CLOSE
                            
                                Windows Command Line Equivalent to "time" in Linux? [duplicate]
                            
                                How can I search for a string in the memory of another process?
                            
                                Cannot overwrite variable because it is readonly
                            
                                Docker Quickstart Terminal fails to start VirtualBox VM in Windows 10
                            
                                Running npm on PowerShell asks "How do you want to open this file?", command line is fine
                            
                                Testing network interrupts in software
                            
                                List of de facto standard keyboard shortcuts for Windows apps?
                            
                                How can I create a new process with another User Account on Windows?
                            
                                How to convert Microsoft Locale ID (LCID) into language code or Locale object in Java
                            
                                Scrolling & zooming SVG Viewer on Windows?
                            
                                Detecting User Activity
                            
                                C#: Glass Forms?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How does Windows Notepad interpret characters?

Tags:

windows

encoding

utf-8

hex-editors

notepad

nEAnnam

People also ask

1 Answers

Guffa

Recent Activity

Donate For Us