What character encoding is used by StreamReader.ReadToEnd()?

Tags:

What character encoding is used by StreamReader.ReadToEnd()?
What would be the reason to use (b) instead of (a) below?
Is there a risk of their being a character encoding problem if (a) is used instead of (b)?
Is there another method that is better than (a) and (b)?

(a)

Dim strWebResponse As String
Dim Request As HttpWebRequest = WebRequest.Create(Url)
Using Response As WebResponse = smsRequest.GetResponse()
    Using reader As StreamReader = New StreamReader(Response.GetResponseStream())
        strWebResponse = reader.ReadToEnd()
    End Using
End Using

(b)

Dim encoding As New UTF8Encoding()
Dim strWebResponse As String
Dim Request As HttpWebRequest = WebRequest.Create(Url)
Using Response As WebResponse = Request.GetResponse()
    Dim responseBuffer(Response.ContentLength - 1) As Byte
    Response.GetResponseStream().Read(responseBuffer, 0, Response.ContentLength - 1)
    strWebResponse = encoding.GetString(responseBuffer)
End Using

534

asked Nov 12 '12 04:11

CJ7

1 Answers

The standard encoding used by StreamReader is ~~Encoding.Default, which will vary from machine to machine depending on your version of Windows and the locale that you have set.~~ Encoding.UTF8.

I have trouble remembering what the defaults are, so I prefer to use the StreamReader constructor that lets me specify the encoding. For example:

Using reader As StreamReader = New StreamReader(Response.GetResponseStream(), Encoding.UTF8)

See the constructor documentation for more info.

If you use that constructor in your example a, the results will be the same as for your example b.

Should you use UTF-8? That depends on the page you're downloading. If the page you're downloading was encoded with UTF-8 then, yes, you should use UTF-8. UTF-8 is supposed to be the default if no character set is defined in the HTTP headers. But you need to check the Content-Type header to determine if the page uses some other encoding. For example, the Content-Type header might read:

 application/xml; charset=ISO-8859-2

You would have to examine the ContentType property of the HttpWebResponse, check to see if there is a charset field, and set the encoding properly based on that.

Or, just use UTF-8 and hope for the best.

answered Sep 18 '22 01:09

Jim Mischel

Related questions
                            
                                long integer literals
                            
                                In C#.NET, how to add version number to static file references, such as HTML and CSS?
                            
                                VB .Net NullReferenceException work-around
                            
                                how to distinguish "real" mail attachment from pics in html mail?
                            
                                Resizing array performance?
                            
                                Creating custom SAML token
                            
                                NHibernate multi query / futures with Oracle
                            
                                Using System.Threading.Tasks.Parallel create new thread in the thread pool?
                            
                                Reset custom system cursor to normal
                            
                                VS2012 .NET 4.0 Clickonce VSTO CryptographicException: SignatureDescription could not be created for the signature algorithm supplied
                            
                                evaluating DBNull: checking for equality or using the 'is' operator?
                            
                                Common.Logging with multiple factory adaptors
                            
                                How to convert json to NameValueCollection
                            
                                Searching for a stream in EventStore
                            
                                How to download image from HTTP only if the image is newer?
                            
                                Why does new Thread() accept a method name, even though none of the constructor overloads seem to allow this? [duplicate]
                            
                                Why doesn't the .NET framework provide a method to deep copy objects? [closed]
                            
                                How FileAttributes.Encrypted work in C#?
                            
                                What does $_ TRULY mean in PowerShell? [duplicate]
                            
                                Passing multiple parameters from url to html.actionlink

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

What character encoding is used by StreamReader.ReadToEnd()?

Tags:

.net

vb.net

encoding

utf-8

streamreader

CJ7

People also ask

1 Answers

Jim Mischel

Recent Activity

Donate For Us