Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Can a plain `char` possibly have trap values?

Tags:

README

A "trap value", or "trap representation" for type T, is a bit combination (of the underlying storage) that yields an invalid value of T. Trying to interpret the representation of an invalid value will cause undefined behavior.


Let the battle begin..

Another question has started a heated discussion regarding char, and the possibility of an implementation having trap representations for it.

Question:

  • Can char possibly have trap values?

Quotes that has been mentioned in the previous discussion:

These sections are the most quoted ones during the previous argumentation, are they contradicting?

3.9.1p1 Fundamental types [basic.fundamental]

It is implementation-defined whether a char can hold negative values. Characters can be explicitly declared signed or unsigned.

A char, a signed char, and an unsigned char occupy the same amount of storage and have the same alignment requirements (3.11); that is, they have the same object representation. For character types, all bits of the object representation participate in the value representation.

For unsigned character types, all possible bit patterns of the value representation represent numbers. These requirements do not hold for other types.

In any particular implementation, a plain char object can take on either the same values as a signed char or an unsigned char; which one is implementation-defined.

3.9p2 Types [basic.types]

For any object (other than a base-class subobject) of trivially copyable type T, whether or not the object holds a valid value of type T, the underlying bytes (1.7) making up the object can be copied into an array of char or unsigned char.

If the content of the array of char or unsigned char is copied back into the object, the object shall subsequently hold its original value.

like image 202
Filip Roséen - refp Avatar asked Jun 04 '14 09:06

Filip Roséen - refp


1 Answers

The standard tells us there must be:

  • char, signed char, unsigned char, all the same size
  • the sizeof(char) is 1
  • char has at least 8 bits
  • every bit combination is meaningful and valid
  • array of char is packed (or behaves is if it is).

There isn't much wiggle room.

Nevertheless there are suggestions that during certain kinds of operations such as loading uninitialised memory or conversions as trap might occur.

Yes, I think an implementation could have a trap representation where trap values could occur as a result of some kind of undefined or unspecified behaviour, including evaluating expressions that involve unspecified/uninitialised values. The actual bit pattern leading to a trap value would be invisible to the implementation.

Such a CPU could have 9 bit bytes where only 8 bits are visible to the compiler and runtime, and the 9th bit is used to detect uninitialised memory, and will trigger a trap if loaded by (unprivileged) instructions.

like image 114
david.pfx Avatar answered Oct 19 '22 23:10

david.pfx