how to write with a single byte character encoding?

Tags:

I have a webservice that returns the config file to a low level hardware device. The manufacturer of this device tells me he only supports single byte charactersets for this config file.

On this wiki page I found out that the following should be single byte character sets:

ISO 8859
ISO/IEC 646 (I could not find this one here)
various Microsoft/IBM code pages

But when I call Encoding.GetMaxByteCount(1) on these character sets it always returns 2.

I also tried various other encodings (for instance IBM437), but GetMaxByteCount also returns 2 for other character sets.

The method Endoding.IsSingleByte seems unreliable according to this

You should be careful in what your application does with the value for IsSingleByte. An assumption of how an Encoding will proceed may still be wrong. For example, Windows-1252 has a value of true for Encoding.IsSingleByte, but Encoding.GetMaxByteCount(1) returns 2. This is because the method considers potential leftover surrogates from a previous decoder operation.

Also the method Encoding.GetMaxByteCount has some of the same issues according to this

Note that GetMaxByteCount considers potential leftover surrogates from a previous decoder operation. Because of the decoder, passing a value of 1 to the method retrieves 2 for a single-byte encoding, such as ASCII. Your application should use the IsSingleByte property if this information is necessary.

Because of this I am not sure anymore on what to use.

Sjors Miltenburg

1 Answers

Basically, GetMaxByteCount considers an edge-case that you will probably never need in regular code, specifically what it says about the decoder and surrogates. The point here is that some code-points are encoded as surrogate pairs, which in unfortunate cases can mean that it straddles two calls to GetBytes() / GetChars (on the encoder/decoder). As a consequence, the implementation may theoretically have a single byte/character still buffered and waiting to be processed, therefore GetMaxByteCount needs to warn about this.

However! All of this only makes sense if you are using the encoder/decoder directly. If you are using operations on the Encoding, such as Encoding.GetBytes, then all of this is abstracted away from you and you will never need to know. In which case, just use IsSingleByte and you'll be fine.

153

answered Sep 19 '22 17:09

Marc Gravell

Related questions
                            
                                Precision of double after decimal point
                            
                                Cannot implicitly convert MyType<Foo> to MyType<IFoo>
                            
                                Client-side certificate in a Metro app for Windows Azure Service Management
                            
                                Use of BAL in 3 tier architecture?How to call methods from DAL to BAL
                            
                                Throwing HttpResponseException in WebAPI action method returning empty 200 response
                            
                                How should the '\t' character be handled within XML attribute values?
                            
                                Function to clone an arbitrary object
                            
                                Can anyone recommend a way to check if a class can be serialized as XML?
                            
                                How to prevent an infinite loop in ASP.net
                            
                                Call C# webservices from javascript and consume it (json format) [closed]
                            
                                Asynchronous command execution with user confirmation
                            
                                Using polymorphism for collision detection in an elegant way
                            
                                why a # character is added to the url?
                            
                                linq nested list contains
                            
                                Conditional WHERE in LINQ
                            
                                call was rejected by callee
                            
                                Deserializing a List of Objects that contain a Dictionary
                            
                                Will a serverside redirect (HttpResponse.Redirect) hit the load balancer again?
                            
                                How can I use Generics to create a way of making an IEnumerable from an enum?
                            
                                Representing Ternary Plot Data For Lookups

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

how to write with a single byte character encoding?

Tags:

c#

character-encoding

encoding

unicode

ansi

Sjors Miltenburg

People also ask

1 Answers

Marc Gravell

Recent Activity

Donate For Us