Using collections.Counter to count emojis with different colors

Tags:

I would like to use the collections.Counter class to count emojis in a string. It generally works fine, however, when I introduce colored emojis the color component of the emoji is separated from the emoji like so:

>>> import collections
>>> emoji_string = "👌🏻👌🏼👌🏽👌🏾👌🏿"
>>> emoji_counter = collections.Counter(emoji_string)
>>> emoji_counter.most_common()
[('👌', 5), ('🏻', 1), ('🏼', 1), ('🏽', 1), ('🏾', 1), ('🏿', 1)]

How can I make the most_common() function return something like this instead:

[('👌🏻', 1), ('👌🏼', 1), ('👌🏽', 1), ('👌🏾', 1), ('👌🏿', 1)]

I'm using Python 3.6

407

asked May 08 '17 16:05

Toni Sučić

1 Answers

You'll have to split your string into separate clusters. Each of your emoji is really two codepoints; the emoji and a EMOJI MODIFIER FITZPATRICK TYPE X codepoint:

>>> print(emoji_string[0])
👌
>>> print(emoji_string[1])
🏻
>>> print(emoji_string[:2])
👌🏻
>>> print(ascii(emoji_string[:2]))
'\U0001f44c\U0001f3fb'
>>> import unicodedata
>>> unicodedata.name(emoji_string[1])
'EMOJI MODIFIER FITZPATRICK TYPE-1-2'

You could use a regular expression to keep those with the preceding emoji:

import re

char_with_modifier = re.compile(r'(.[\U0001f3fb-\U0001f3ff]?)')
split_emoji = char_with_modifier.findall(emoji_string)

and count the result.

Demo:

>>> import re
>>> from collections import Counter
>>> emoji_string = "👌🏻👌🏼👌🏽👌🏾👌🏿"
>>> char_with_modifier = re.compile(r'(.[\U0001f3fb-\U0001f3ff]?)')
>>> Counter(char_with_modifier.findall(emoji_string))
Counter({'👌🏻': 1, '👌🏼': 1, '👌🏽': 1, '👌🏾': 1, '👌🏿': 1})

answered Sep 25 '22 10:09

Martijn Pieters

Related questions
                            
                                Why does nesting "yield from" statements (generator delegation) produce terminating `None` value?
                            
                                Floyd-Warshall algorithm: get the shortest paths
                            
                                run pyspark locally
                            
                                Python pandas convert rows to columns where multiple columns exist [duplicate]
                            
                                Sending email with attached file in Django
                            
                                Django implementation of default value in database
                            
                                How to redirect stderr and stdout into /var/log directory in background process?
                            
                                Concating pandas dataframe
                            
                                Python: How to convert Pyspark column to date type if there are null values
                            
                                Filtering pyspark dataframe if text column includes words in specified list
                            
                                How can I split this string up?
                            
                                What is best practice to create a small database in python?
                            
                                Python data frame Export to csv with Quotation marks (")
                            
                                Rendering a 3D mesh into an image using python
                            
                                Converting svg from Highcharts data into data points
                            
                                Adding gravity to a bouncing ball using vectors
                            
                                Multiply all rows in a Pandas DataFrame by dictionary
                            
                                How to run python-socketio in Thread?
                            
                                Pandas - merging dataframes conditionally on multiple columns
                            
                                Other option for colored scrollbar in tkinter based program?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Using collections.Counter to count emojis with different colors

Tags:

python

unicode

emoji

counter

Toni Sučić

People also ask

1 Answers

Martijn Pieters

Recent Activity

Donate For Us