Why is File.ReadAllBytes result different than when using File.ReadAllText?

Tags:

I have a text file (UTF-8 encoding) with contents "test". I try to get the byte array from this file and convert to string, but it contains one strange character. I use the following code:

var path = @"C:\Users\Tester\Desktop\test\test.txt"; // UTF-8

var bytes = File.ReadAllBytes(path);
var contents1 = Encoding.UTF8.GetString(bytes);

var contents2 = File.ReadAllText(path);

Console.WriteLine(contents1); // result is "?test"
Console.WriteLine(contents2); // result is "test"

conents1 is different than contents2 - why?

404

asked Sep 29 '14 14:09

Dragon

3 Answers

As explained in ReadAllText's documentation:

This method attempts to automatically detect the encoding of a file based on the presence of byte order marks. Encoding formats UTF-8 and UTF-32 (both big-endian and little-endian) can be detected.

So the file contains BOM (Byte order mark), and ReadAllText method correctly interprets it, while the first method just reads plain bytes, without interpreting them at all.

Encoding.GetString says that it only:

decodes all the bytes in the specified byte array into a string

(emphasis mine). Which is of course not entirely conclusive, but your example shows that this is to be taken literally.

108

answered Oct 27 '22 01:10

BartoszKP

You are probably seeing the Unicode BOM (byte order mark) at the beginning of the file. File.ReadAllText knows how to strip this off, but Encoding.UTF8 does not.

answered Oct 27 '22 01:10

recursive

It's the UTF8 encoding prefix string. It marks the file as UTF8 encoded. ReadAllText doesn't return it because it's a parsing instruction.

answered Oct 27 '22 00:10

PhillipH

Related questions
                            
                                log4net: different logs on different file appenders at runtime
                            
                                drag and drop cell from datagridview to another
                            
                                Does usage of Thread.Sleep(n) causes performance issues?
                            
                                DataGridView put text in ColumnHeader of RowHeader
                            
                                EF 6 select from other table without navigation property
                            
                                Does the DockingManager come with a built-in method for handling Anchorables
                            
                                Why Char.IsDigit returns true for chars which can't be parsed to int?
                            
                                Extending asp.net mvc 5 identity with custom tables
                            
                                ASP.Net MVC 5 sub-directory bundling issues
                            
                                Concurrent XmlReader and XmlWriter
                            
                                How to get all possible combinations for n arrays with different number of elements?
                            
                                How to get the exact type of numeric columns incl. scale and precision?
                            
                                Why does this work? Executing method from IL without instance
                            
                                How threadsafe is System.Reflection.Emit?
                            
                                How do I get WebAPI to validate my JSON with JsonProperty(Required = Required.Always)?
                            
                                Is ConcurrentDictionary ContainsKey method synched?
                            
                                How sql dependency works for passing data back & forth
                            
                                Upgrade to EF 6.1.1 makes effect of [NotMapped] disappear
                            
                                Thread memory leak
                            
                                Change default startup page for windows phone 8.1 app

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why is File.ReadAllBytes result different than when using File.ReadAllText?

Tags:

string

c#

byte