Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What are all of the allowable characters for people's names? [closed]

There are the standard A-Z, a-z characters, but also there are hyphens, em dashes, quotes, etc.

Plus, there are all of the international characters, like umlauts, etc.

So, for an English-based system, what's the complete set? What about sets for other languages? What about UTF8, UTF16, etc?

Bonus question: How many name fields are needed, and what are their maximum lengths?

EDIT: There are definitely two different types of characters involved in people's names, those that are there as part of the context, and those that are there for structural reasons. I don't want to limit or interfere with the context characters, but I do need to deal with the structural ones.

For example, I had a name come in that was separated by an em dash, but it was hard to distinguish that from the minus character. To make the system easier for searching, I want to take all five different types of dashes, and map them onto one unique character (minus), that way the searcher doesn't need to know specifically which symbol was initially entered.

The problem exists for dashes, probably quotes as well, but also how many other symbols?

like image 299
Paul W Homer Avatar asked Jan 07 '09 16:01

Paul W Homer


People also ask

What special characters are allowed in people names?

Any character that can be represented by any multiple of eight bits (greater than zero) is a possible character for a person's name. Lengths of both names and encodings are arbitrary, so no upper bound should be considered.

What characters are not allowed in usernames?

Usernames cannot contain an ampersand (&), equals sign (=), underscore (_), apostrophe ('), dash (-), plus sign (+), comma (,), brackets (<,>), or more than one period (.) in a row.

Are symbols allowed in names?

The law bans names that contain “obscenity, numerals, symbols, or a combination of letters, numerals, or symbols…”, but naming a child after a mass murderer is A-OK. In most cases, the United States is pretty relaxed about what you can name your child when it comes to the stigma or meaning a name may carry.

What is special characters are not allowed?

You can name files using almost any character for a name, except for the following reserved characters: < > : " / \ | ? * The maximum length for a path is 255 characters. This limitation includes the drive letter, colon, backslash, directories, subdirectories, filename, and extension.


1 Answers

There's good article by the W3C called Personal names around the world that explains the problems (and possible solutions) pretty well (it was originally a two-part blog post by Richard Ishida: part 1 and part 2)

Personally I'd say: support every printable Unicode-Character and to be safe provide just a single field "name" that contains the full, formatted name. This way you can store pretty much every form of name. You might need a more structured storage, but then don't expect to be able to store every single combination in a structured form, as there are simply too many different ones.

like image 182
Joachim Sauer Avatar answered Oct 02 '22 14:10

Joachim Sauer