I have been trying to work on this issue for a while.I am trying to remove non ASCII characters form DB_user column and trying to replace them with spaces. But I keep getting some errors. This is how my data frame looks: <pre class="prettyprint"> +----------------------------------------------------------- | DB_user source count | +----------------------------------------------------------- | ???/"Ò|Z?)?]??C %??J A 10 | | ?D$ZGU ;@D??_???T(?) B 3 | | ?Q`H??M'?Y??KTK$?Ù&lsaquo;???Ð©JL4??*?_?? C 2 | +----------------------------------------------------------- </pre> I was using this function, which I had come across while researching the problem on SO. <pre class="prettyprint"><code>def filter_func(string): for i in range(0,len(string)): if (ord(string[i])< 32 or ord(string[i])>126 break return '' And then using the apply function: df['DB_user'] = df.apply(filter_func,axis=1) </code></pre> I keep getting the error: <pre class="prettyprint"> 'ord() expected a character, but string of length 66 found', u'occurred at index 2' </pre> However, I thought by using the loop in the filter_func function, I was dealing with this by inputing a char into 'ord'. Therefore the moment it hits a non-ASCII character, it should be replaced by a space. Could somebody help me out? Thanks!

you may try this: <pre class="prettyprint"><code>df.DB_user.replace({r'[^\x00-\x7F]+':''}, regex=True, inplace=True) </code></pre>

Remove non-ASCII characters from pandas column

Tags:

I have been trying to work on this issue for a while.I am trying to remove non ASCII characters form DB_user column and trying to replace them with spaces. But I keep getting some errors. This is how my data frame looks:

  +----------------------------------------------------------- |      DB_user                            source   count  |                                              +----------------------------------------------------------- | ???/"Ò|Z?)?]??C %??J                      A        10   |                                        | ?D$ZGU   ;@D??_???T(?)                    B         3   |                                        | ?Q`H??M'?Y??KTK$?Ù‹???Ð©JL4??*?_??        C         2   |                                         +-----------------------------------------------------------

I was using this function, which I had come across while researching the problem on SO.

def filter_func(string):    for i in range(0,len(string)):         if (ord(string[i])< 32 or ord(string[i])>126            break        return ''  And then using the apply function:  df['DB_user'] = df.apply(filter_func,axis=1)

I keep getting the error:

  'ord() expected a character, but string of length 66 found', u'occurred at index 2'

However, I thought by using the loop in the filter_func function, I was dealing with this by inputing a char into 'ord'. Therefore the moment it hits a non-ASCII character, it should be replaced by a space.

Could somebody help me out?

Thanks!

213

asked Mar 31 '16 18:03

red_devil

1 Answers

you may try this:

df.DB_user.replace({r'[^\x00-\x7F]+':''}, regex=True, inplace=True)

answered Sep 18 '22 15:09

MaxU - stop WAR against UA

Related questions
                            
                                How to use System.Windows.Forms in .NET Core class library
                            
                                Get second td of tr using jquery
                            
                                Schedule a .Net Core console application on windows using Task Scheduler
                            
                                How to extend/modify (customize) Bootstrap with SASS
                            
                                Cannot find Bitmap Class in Class Library (.NET Standard)
                            
                                VS Code Auto Indent / Code Formatting changes single quotation marks to double
                            
                                Access Configuration from a View in ASP.NET Core
                            
                                Why can’t this static inner class call a non-static method on its outer class?
                            
                                pull remote branch without merge
                            
                                Mysql 8 remote access
                            
                                How to create a transparent full screen dialog on top of activity - Flutter
                            
                                how to display animated gif in flutter?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With