UTF8 byte[] to string conversion

Tags:

I have UTF8 byte[] of infinite size (i.e. of very large size). I want to truncate it to 1024 bytes only and then convert it to string.

Encoding.UTF8.GetString(byte[], int, int) does that for me. It first shortens 1024 bytes and then gives me its converted string.

But in this conversion, if last character is of UTF8 character set, which is made of 2 bytes and whose first byte falls in range and another byte is out of range then it displays ? for that character in converted string.

Is there any way so that this ? does not come in converted string?

601

asked Apr 20 '16 09:04

pratik03

1 Answers

That's what the Decoder class is for. It allows you to stream byte data into char data, while maintaining enough state to handle partial code-points correctly:

Encoding.UTF8.GetDecoder().GetChars(buffer, 0, 1024, charBuffer, 0)

Of course, when the code-point is split in the middle, the Decoder is left with a "partial char" in its state, but that doesn't concern you in your case (and is desirable in all the other use cases :)).

197

answered Sep 28 '22 00:09

Luaan

Related questions
                            
                                Creating dynamic expression for entity framework
                            
                                Why is my DeflateStream not receiving data correctly over TCP?
                            
                                SqlConnection namespace not found
                            
                                MVC 6 HttpResponseException
                            
                                What is the difference between these awaitable methods?
                            
                                Cast object to method generic type
                            
                                Could not load file or assembly 'Microsoft.AI.Agent.Intercept' or one of its dependencies
                            
                                asp.net core 1.0 mvc. Get raw content from Request.Body
                            
                                C# Await Multiple Events in Producer/Consumer
                            
                                convert a png file to a pcx file using c#
                            
                                How to hide pdf importer popup in word automation
                            
                                Decode Html-encoded characters during Json deserialization
                            
                                AutoMapper, moving away from the Obsolete Static API
                            
                                How to remove @strin3http//schemas.microsoft.com/2003/10/Serialization/� received from service bus queue received in python script?
                            
                                Setting default CurrentCulture and CurrentUICulture (differences between .NET 4.5.2 and .NET 4.6)
                            
                                How to break on specific Guid using Visual Studio Conditional Breakpoint
                            
                                how to do Max Aggregation in LINQ query syntax?
                            
                                UWP Raspberry Pi Webserver issue
                            
                                Corrupted images when using Read and Write streams to save files
                            
                                Change from bitarray to enum

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

UTF8 byte[] to string conversion

Tags:

string

c#

type-conversion

utf-8

pratik03

People also ask

1 Answers

Luaan

Recent Activity

Donate For Us