From Creating a subset of words from a corpus in R, the answerer can easily convert a term-document matrix
into a word cloud easily.
Is there a similar function from python libraries that takes either a raw word textfile or NLTK
corpus or Gensim
Mmcorpus into a word cloud?
The result will look somewhat like this:
The three steps are: Extract the review (text document) Create and generate a wordcloud image. Display the cloud using matplotlib.
from wordcloud import WordCloud, STOPWORDS import matplotlib.pyplot as plt stopwords = set(STOPWORDS) def show_wordcloud(data, title = None): wordcloud = WordCloud( background_color='white', stopwords=stopwords, max_words=200, max_font_size=40, scale=3, random_state=1 # chosen at random by flipping a coin; it was heads ).generate(str(data)) fig = plt.figure(1, figsize=(12, 12)) plt.axis('off') if title: fig.suptitle(title, fontsize=20) fig.subplots_adjust(top=2.3) plt.imshow(wordcloud) plt.show() show_wordcloud(Samsung_Reviews_Negative['Reviews']) show_wordcloud(Samsung_Reviews_positive['Reviews'])
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With