I am parsing a webpage which has Unicode representations of fractions. I would like to be able to take those strings directly and convert them to floats. For example: "⅕" would become 0.2 Any suggestions of how to do this in Python?

You want to use the unicodedata module: <pre class="prettyprint"><code>import unicodedata unicodedata.numeric(u'⅕') </code></pre> This will print: <pre class="prettyprint"><code>0.20000000000000001 </code></pre> If the character does not have a numeric value, then <code>unicodedata.numeric(unichr[, default])</code> will return default, or if default is not given will raise ValueError.

How do I convert unicode characters to floats in Python?

2 Answers

You want to use the unicodedata module:

import unicodedata
unicodedata.numeric(u'⅕')

This will print:

0.20000000000000001

If the character does not have a numeric value, then unicodedata.numeric(unichr[, default]) will return default, or if default is not given will raise ValueError.

answered Sep 19 '22 21:09

Karl Voigtland

Those Unicode representations of floats are called Vulgar Fractions

You can covert them to floats using unicodedata.numeric(char)

However, numeric(char) won't work on something like 3¾. That takes a bit more effort:

from unicodedata import numeric

samples = ["3¼","19¼","3 ¼","10"]

for i in samples:
    if len(i) == 1:
        v = numeric(i)
    elif i[-1].isdigit():
        # normal number, ending in [0-9]
        v = float(i)
    else:
        # Assume the last character is a vulgar fraction
        v = float(i[:-1]) + numeric(i[-1])
    print(i, v)

Output:

3¼ 3.25
19¼ 19.25
3 ¼ 3.25
10 10.0

You might also be interested isolating these vulgar fractions from broader user input using regular expressions. You can do so using ranges of their unicode character codes:

/[\u2150-\u215E\u00BC-\u00BE]/g

Sample: https://regexr.com/3p8nd

answered Sep 20 '22 21:09

Jason Lewallen

Related questions
                            
                                python read csv file with row and column headers into dictionary with two keys
                            
                                NotImplementedError: Can't perform this operation for unregistered loader type
                            
                                Seaborn pairplot legend - how to control position
                            
                                Bar graph from dataframe groupby
                            
                                How to choose keys from a python dictionary based on weighted probability? [duplicate]
                            
                                What does -1 in numpy reshape mean? [duplicate]
                            
                                How to decode/deserialize Avro with Python from Kafka
                            
                                resize image canvas to maintain square aspect ratio in Python, OpenCv
                            
                                Calculate percentile of value in column
                            
                                How to omit module prefix?
                            
                                Pandas: Split a string and then create a new column?
                            
                                convert year and month name into datetime column for pandas dataframe
                            
                                How to iterate over lambda functions in Java
                            
                                tuples as function arguments
                            
                                Flask-Script: from flask._compat import text_type ModuleNotFoundError: No module named 'flask._compat'
                            
                                Passing on named variable arguments in python
                            
                                In Python, how can you easily retrieve sorted items from a dictionary?
                            
                                What's the most efficient way to insert thousands of records into a table (MySQL, Python, Django)
                            
                                Run a function every X minutes - Python
                            
                                Get first non-empty string from a list in python

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How do I convert unicode characters to floats in Python?

Tags:

python

floating-point

unicode

Paul

People also ask

2 Answers

Karl Voigtland

Jason Lewallen

Recent Activity

Donate For Us