Somewhere along the way, between all the imports and exports I have done, a lot of the text on a blog I run is full of weird accented A characters. When I export the data using mysqldump and load it into a text editor with the intention of using search-and-replace to clear out the bad characters, searching just matches every "a" character. Does anyone know any way I can successfully hunt down these characters and get rid of them, either directly in MySQL or by using mysqldump and then reimporting the content?

This is an encoding problem; the <code>Â</code> is a non-breaking space (HTML entity <code>&nbsp;</code>) in Unicode being displayed in Latin1. You might try something like this... first we check to make sure the matching is working: <pre class="prettyprint"><code>SELECT * FROM some_table WHERE some_field LIKE BINARY '%Â%' </code></pre> This should return any rows in <code>some_table</code> where <code>some_field</code> has a bad character. Assuming that works properly and you find the rows you're looking for, try this: <pre class="prettyprint"><code>UPDATE some_table SET some_field = REPLACE( some_field, BINARY 'Â', '' ) </code></pre> And that should remove those characters (based on the page you linked, you don't really want an nbsp there as you would end up with three spaces in a row between sentences etc, you should only have one). If it doesn't work then you'll need to look at the encoding and collation being used. EDIT: Just added <code>BINARY</code> to the strings; this should hopefully make it work regardless of encoding.

Removing strange characters from MySQL data

2 Answers

This is an encoding problem; the Â is a non-breaking space (HTML entity  ) in Unicode being displayed in Latin1.

You might try something like this... first we check to make sure the matching is working:

Click to copy

SELECT * FROM some_table WHERE some_field LIKE BINARY '%Â%'

This should return any rows in some_table where some_field has a bad character. Assuming that works properly and you find the rows you're looking for, try this:

Click to copy

UPDATE some_table SET some_field = REPLACE( some_field, BINARY 'Â', '' )

And that should remove those characters (based on the page you linked, you don't really want an nbsp there as you would end up with three spaces in a row between sentences etc, you should only have one).

If it doesn't work then you'll need to look at the encoding and collation being used.

EDIT: Just added BINARY to the strings; this should hopefully make it work regardless of encoding.

200

answered Sep 23 '22 19:09

kitti

The accepted answer did not work for me.

From here http://nicj.net/mysql-converting-an-incorrect-latin1-column-to-utf8/ I have found that the binary code for Â character is c2a0 (by converting the column to VARBINARY and looking what it turns to). Then here http://www.oneminuteinfo.com/2013/11/mysql-replace-non-ascii-characters.html found the actual solution to remove (replace) it:

Click to copy

update entry set english_translation = unhex(replace(hex(english_translation),'C2A0','20')) where entry_id = 4008;

The query above replaces it to a space, then a normal trim can be applied or simply replace to '' instead.

answered Sep 24 '22 19:09

user109764

Related questions
                            
                                SQL Query for all pairs of elements that are only in different groups
                            
                                MYSQL command to execute multiple .sql files
                            
                                Imported tables are not showing up in phpmyadmin [closed]
                            
                                $(a,this).attr('href') returns undefined
                            
                                How to use mysql_real_escape_string function in PHP
                            
                                How to connect NetBeans to MySQL database?
                            
                                Import large MySQL .sql file on Windows with Force
                            
                                How do you store scientific notation numbers in MySQL
                            
                                MySql auto-incrementing Alpha-numeric primary key?
                            
                                which datatype i should use for mapping to boolean
                            
                                DB indices to use for multiple combination of queries of same set of columns?
                            
                                SQL: How do I select one of two fields, depending on a third field
                            
                                Trying to get one cell's values with MySQLdb
                            
                                Symfony 404 errors
                            
                                php singleton database connection, is this code bad practice?
                            
                                Update multiple rows for 2 columns in MySQL
                            
                                Why INT(11) when it only stores 10 digits?
                            
                                MYSQL Order from another Table
                            
                                mysql database insert is changing all IDs to 4294967295
                            
                                Hashing or encrypting variables to be sent in a url

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Removing strange characters from MySQL data

Tags:

mysql

Andy Soell

People also ask

2 Answers

kitti

user109764

Recent Activity

Donate For Us