I have text reviews in one column in Pandas dataframe and I want to count the N-most frequent words with their frequency counts (in whole column - NOT in single cell). One approach is Counting the words using a counter, by iterating through each row. Is there a better alternative? Representative data. <pre class="prettyprint"><code>0 a heartening tale of small victories and endu 1 no sophomore slump for director sam mendes w 2 if you are an actor who can relate to the sea 3 it's this memory-as-identity obviation that g 4 boyd's screenplay ( co-written with guardian </code></pre>

<pre class="prettyprint"><code>from collections import Counter Counter(" ".join(df["text"]).split()).most_common(100) </code></pre> im pretty sure would give you what you want (you might have to remove some non-words from the counter result before calling most_common)

Count most frequent 100 words from sentences in Dataframe Pandas

Tags:

I have text reviews in one column in Pandas dataframe and I want to count the N-most frequent words with their frequency counts (in whole column - NOT in single cell). One approach is Counting the words using a counter, by iterating through each row. Is there a better alternative?

Representative data.

0    a heartening tale of small victories and endu 1    no sophomore slump for director sam mendes  w 2    if you are an actor who can relate to the sea 3    it's this memory-as-identity obviation that g 4    boyd's screenplay ( co-written with guardian

933

asked Apr 27 '15 18:04

swati saoji

1 Answers

from collections import Counter Counter(" ".join(df["text"]).split()).most_common(100)

im pretty sure would give you what you want (you might have to remove some non-words from the counter result before calling most_common)

answered Oct 16 '22 14:10

Joran Beasley

Related questions
                            
                                Systemd script does ExecStop right after ExecStart
                            
                                Stripe Error 400 - Cannot use stripe token more than once
                            
                                How to set a different color to the largest bar in a seaborn barplot
                            
                                Laravel 5.1 PHP DOMDocument() class not found
                            
                                Angular Grid ag-grid columnDefs Dynamically change
                            
                                Colon at the beginning of line in docker entrypoint bash script [duplicate]
                            
                                Run mocha excluding paths
                            
                                Java DateTimeFormatter for time zone with an optional colon separator?
                            
                                Unsupported Configuration: This file is set to build for a version older than the deployment target. Functionality may be limited
                            
                                iOS 9.3 : An SSL error has occurred and a secure connection to the server cannot be made
                            
                                RDP session is slow
                            
                                angular 2 typescript An implementation cannot be declared in ambient contexts

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With