Remove punctuations in pandas [duplicate]

Tags:

code: df['review'].head()
        index         review
output: 0      These flannel wipes are OK, but in my opinion

I want to remove punctuations from the column of the dataframe and create a new column.

code: import string 
      def remove_punctuations(text):
          return text.translate(None,string.punctuation)

      df["new_column"] = df['review'].apply(remove_punctuations)

Error:
  return text.translate(None,string.punctuation)
  AttributeError: 'float' object has no attribute 'translate'

I am using python 2.7. Any suggestions would be helpful.

367

asked Sep 30 '16 01:09

data_person

2 Answers

Using Pandas str.replace and regex:

df["new_column"] = df['review'].str.replace('[^\w\s]','')

answered Oct 09 '22 17:10

Bob Haffner

You can build a regex using the string module's punctuation list:

df['review'].str.replace('[{}]'.format(string.punctuation), '')

answered Oct 09 '22 16:10

David C

Related questions
                            
                                Why does resharper say 'Catch clause with single 'throw' statement is redundant'?
                            
                                How do you cut off text after a certain amount of characters in PHP?
                            
                                mysql custom sort
                            
                                Generate a random number within range? [duplicate]
                            
                                C# list sort by two columns
                            
                                How to combine two NSString?
                            
                                webChromeClient opens link in browser
                            
                                Number of items in a list filtered AngularJS
                            
                                HTML5 Video is not working with AngularJS ng-src tag
                            
                                can I display parameters value ($_POST, $_GET, $_SESSION, $_SESSION) using twig template engine component
                            
                                UITabBar will hide the last cell of the UITableView
                            
                                Plotting multiple line graph using pandas and matplotlib

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With