check if javascript string is valid UTF-8

2 Answers

I think you misunderstand what "UTF-8 characters" means. UTF-8 is an encoding of Unicode which can represent pretty-much every single character and glyph that has ever existed in recorded human history, so that extent there are no "invalid" UTF-8 characters.

RTF is a formatting system which works independently of the underlying encoding system - you can use RTF with ASCII, UTF-8, UTF-16 and others. Textboxes in HTML only respect plain text, so any RTF formatting will be automatically stripped (unless you're using a "rich-edit" component, which I assume you're not).

But you do describe things like whitespace characters (like tabs: \t) are represented in Unicode (and so, UTF-8). A string containing those characters is still "valid UTF-8", it's just invalid as far as your business-requirements are concerned.

I suggest just stripping-out unwanted characters using a regular-expression that matches non-visible characters (from here: Match non printable/non ascii characters and remove from text )

textBoxContent = textBoxContent.replace(/[^\x20-\x7E]+/g, '');

The expression [^\x20-\x7E] matches any character NOT in the codepoint range 0x20 (32, a normal space character ' ') to 0x7E (127, the tidle '~' character), all others will be removed.

Unicode's first 127 codepoints are identical to ASCII and can be seen here: http://www.asciitable.com/

answered Oct 30 '22 16:10

Dai

Just an idea:

function checkUTF8(text) {
    var utf8Text = text;
    try {
        // Try to convert to utf-8
        utf8Text = decodeURIComponent(escape(text));
        // If the conversion succeeds, text is not utf-8
    }catch(e) {
        // console.log(e.message); // URI malformed
        // This exception means text is utf-8
    }   
    return utf8Text; // returned text is always utf-8
}

answered Oct 30 '22 18:10

Daniel Rodriguez

Related questions
                            
                                Import existing AMD module into ES6 module
                            
                                How to wait till the google maps API has loaded before loading a google.maps.OverlayView derived class
                            
                                AngularJS doesn't show specific errors in the Firebug console anymore
                            
                                Error while using unzip with 12mb file size
                            
                                Hapi/Joi validation with nested object
                            
                                Reentrancy in JavaScript
                            
                                CORS issue when getting a token in Azure AD B2C (Implict Flow)
                            
                                moment.js: how to get short date format?
                            
                                d3 accessing nested data in grouped bar chart
                            
                                Remove objects from array - Two different approaches, two different results when consulting the length of each array
                            
                                How to Fetch Serial Number from Android Mobile Browser using Java Scripting which will be from web application source running in mobile browser [closed]
                            
                                Parse a Json(with array and objects) and export the data into Excel file in Node.js
                            
                                How to determine if google auth2.signIn() window was closed by the user?
                            
                                Is it possible to generate an image (blob or data-url) in a web worker from a canvas context's getImageData?
                            
                                jQuery ajax request: how to access sent data in success function?
                            
                                Can you use absolute paths in Electron?
                            
                                Browserify, minifyify, conditional compilation
                            
                                Stubbing nested function calls in sinon
                            
                                Callback not fired on google-sign-in
                            
                                Achieving a preview of a PDF or hiding parts of a PDF in a web page from BLOB format -Angular

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

check if javascript string is valid UTF-8

Tags:

javascript

html

utf-8

eNddy

People also ask

2 Answers

Dai

Daniel Rodriguez

Recent Activity

Donate For Us