I want a UTF8 collation for supporting: <ul> <li>English</li> <li>Persian</li> <li>Arabic</li> <li>French</li> <li>Japanese</li> <li>Chinese</li> </ul> Does <code>UTF8_GENERAL_CI</code> support all these Languages?

As <code>UTF8_GENERAL_CI</code> was a good decision some time ago. It has some drawbacks now. MySQL's UTF8 actually uses 3 bytes instead of 4, which you need for symbols like emojis and new asian chars. So MySQL has a newer charset called utf8mb4 which actually complies with UTF8 definition. To be able fully support Asian languages you will need to choose utf8mb4. If you care about correct sorting in multiple languages, use <code>utf8mb4_unicode</code> or <code>utf8mb4_unicode_ci</code> instead general. A more detailed answer you can find in What's the difference between utf8_general_ci and utf8_unicode_ci

Which of utf8 collations is the best? [closed]

2 Answers

Yes, that is correct. UTF-8 is an encoding for the Unicode character set, which supports pretty much every language in the world.

I think the only difference comes with sorting your results, different letters might come in a different order in other languages (accents, umlauts, etc.). Also, comparing a to ä might behave differently in another collation.

The _ci suffix means sorting and comparison happens case insensitive.

http://www.collation-charts.org/ might be of interest to you.

answered Oct 06 '22 00:10

knittl

As UTF8_GENERAL_CI was a good decision some time ago. It has some drawbacks now.

MySQL's UTF8 actually uses 3 bytes instead of 4, which you need for symbols like emojis and new asian chars.

So MySQL has a newer charset called utf8mb4 which actually complies with UTF8 definition.

To be able fully support Asian languages you will need to choose utf8mb4.

If you care about correct sorting in multiple languages, use utf8mb4_unicode or utf8mb4_unicode_ci instead general.

A more detailed answer you can find in What's the difference between utf8_general_ci and utf8_unicode_ci

answered Oct 06 '22 01:10

Aistis

Related questions
                            
                                Equivalent of MySQL ON DUPLICATE KEY UPDATE in Sql Server
                            
                                Storing Social Security Numbers
                            
                                How can I set the max number of MySQL processes or threads?
                            
                                Performance of RegEx vs LIKE in MySql queries
                            
                                UTF8 Encoding problem - With good examples
                            
                                Call a stored procedure for each row returned by a query in MySQL
                            
                                MySQL - SELECT AS in WHERE
                            
                                Is there a performance hit using decimal data types (MySQL / Postgres)
                            
                                Performing a LIKE comparison on an INT field
                            
                                What does the := operator mean in mysql?
                            
                                What is the syntax to force the use of an index for a join in MySQL
                            
                                How to pass a list of IDs to MySQL stored procedure?
                            
                                MYSQL 8.0 - unsupported redo log format
                            
                                MySQL Views - When to use & when not to
                            
                                Using SqlDataAdapter to insert a row
                            
                                How to clear query cache in mysql?
                            
                                Syntax error or access violation: 1059 Identifier name is too long
                            
                                Error code 1064, SQL state 42000: You have an error in your SQL syntax;
                            
                                SQLSTATE[HY093]: Invalid parameter number: number of bound variables does not match number of tokens on line 102 [closed]
                            
                                PMA Database ... not OK in phpMyAdmin upgrade

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Which of utf8 collations is the best? [closed]

Tags:

mysql

collation

armin etemadi

People also ask

2 Answers

knittl

Aistis

Recent Activity

Donate For Us