What is the point of the UTF-8 character literals proposed for C++17?

Tags:

What exactly is the point of these as proposed by N4267 ?

Their only function seems to be to prevent extended ASCII characters or partial UTF-8 code points from being specified. They still store in a fixed-width 8-bit char (which, as I understand it, is the correct and best way to handle UTF-8 anyway for almost all use cases), so they don't support non-ASCII characters at all. What is going on?

(Actually I'm not entirely sure I understand the need for UTF-8 string literals either. I guess it's the worry of compilers doing weird/ambiguous things with Unicode strings coupled with validation of the Unicode?)

645

asked Aug 12 '15 15:08

Muzer

1 Answers

The rationale is covered in by the Evolution Working Group issue 119: N4197 Adding u8 character literals, [tiny] Why no u8 character literals? which tracked the proposal and says:

We have five encoding-prefixes for string-literals (none, L, u8, u, U) but only four for character literals -- the missing one is u8 for character literals.

This matters for implementations where the narrow execution character set is not ASCII. In such a case, u8 character literals would provide an ideal way to write character literals with guaranteed ASCII encoding (the single-code-unit u8 encodings are exactly ASCII), but... we don't provide them. Instead, the best one can do is something like this:
char x_ascii = { u'x' }; 
... where we'll get a narrowing error if the codepoint doesn't fit in a 'char'. (Note that this is not quite the same as u8'x', which would give us an error if the codepoint was not representable as a single code unit in UTF-8.)

173

answered Oct 19 '22 00:10

Shafik Yaghmour

Related questions
                            
                                Can I use std::make_shared with structs that don't have a parametric constructor?
                            
                                Splash screen for universal windows 10 apps
                            
                                PHP 7 Performance
                            
                                Rxjava and Volley Requests
                            
                                Preprocessor Defines in Typescript
                            
                                Generating predictions from inferred parameters in pymc3
                            
                                Readonly fields becomes null when disposing from finalizer
                            
                                React conditional render pattern
                            
                                How can I serve an AngularJS 2 app without having to also serve all the files in `node_modules`?
                            
                                How can I simulate a keypress in JavaScript? [duplicate]
                            
                                Handling changes to files with --skip-worktree from another branch
                            
                                How to provide custom animation during sorting (notifyDataSetChanged) on RecyclerView

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With