Converting long integers to strings in pandas (to avoid scientific notation)

Tags:

I want the following records (currently displaying as 3.200000e+18 but actually (hopefully) each a different long integer), created using pd.read_excel(), to be interpreted differently:

ipdb> self.after['class_parent_ref']
class_id
3200000000000515954    3.200000e+18
3200000000000515951             NaN
3200000000000515952             NaN
3200000000000515953             NaN
3200000000000515955    3.200000e+18
3200000000000515956    3.200000e+18
Name: class_parent_ref, dtype: float64

Currently, they seem to 'come out' as scientifically notated strings:

ipdb> self.after['class_parent_ref'].iloc[0]
3.2000000000005161e+18

Worse, though, it's not clear to me that the number has been read correctly from my .xlsx file:

ipdb> self.after['class_parent_ref'].iloc[0] -3.2e+18
516096.0

The number in Excel (the data source) is 3200000000000515952.

This is not about the display, which I know I can change here. It's about keeping the underlying data in the same form it was in when read (so that if/when I write it back to Excel, it'll look the same and so that if I use the data, it'll look like it did in Excel and not Xe+Y). I would definitely accept a string if I could count on it being a string representation of the correct number.

You may notice that the number I want to see is in fact (incidentally) one of the labels. Pandas correctly read those in as strings (perhaps because Excel treated them as strings?) unlike this number which I entered. (Actually though, even when I enter ="3200000000000515952" into the cell in question before redoing the read, I get the same result described above.)

How can I get 3200000000000515952 out of the dataframe? I'm wondering if pandas has a limitation with long integers, but the only thing I've found on it is 1) a little dated, and 2) doesn't look like the same thing I'm facing.

Thank you!

481

asked Oct 27 '14 20:10

HaPsantran

1 Answers

Convert your column values with NaN into 0 then typcast that column as integer to do so.

df[['class_parent_ref']] = df[['class_parent_ref']].fillna(value = 0)
df['class_parent_ref'] = df['class_parent_ref'].astype(int)

Or in reading your file, specify keep_default_na = False for pd.read_excel() and na_filter = False for pd.read_csv()

190

answered Sep 29 '22 01:09

Joe

Related questions
                            
                                Gunicorn+flask+pymongo+gevent hangs on initialization
                            
                                SQLAlchemy: add a child in many-to-many relationship by IDs
                            
                                tests under tox don't necessarily use the installed code
                            
                                Tell how an argument was received by a function?
                            
                                Sqlite3 Module in Python far Slower SELECT than in Shell
                            
                                Assigning to slices of pandas DataFrames
                            
                                pyodbc.Error: ('IM002', '[IM002] [unixODBC][Driver Manager]Data source name not found, and no default driver specified (0) (SQLDriverConnect)')
                            
                                python cql driver - cassandra.ReadTimeout - "Operation timed out - received only 1 responses."
                            
                                Segmentation fault with Python/Chrome/Java (linux mint)
                            
                                Create csv file with metadata header followed by timeseries in Python / Pandas
                            
                                Why is whoosh commit so slow
                            
                                Error in `/usr/bin/python': double free or corruption (out): 0x00007f7c3c017260
                            
                                Appropriate approach for Message Queue / Scheduled tasks in Django
                            
                                Uploading a static project to google app engines
                            
                                How many times the finalizer method is called and zombies (PEP 442)
                            
                                raw_input() and sys.stdin misbehaves on CTRL-C
                            
                                Python: how to reload modules that have been imported with *
                            
                                Sphinx is not updating documentation properly
                            
                                Python/Hive interface slow with fetchone(), hangs with fetchall()
                            
                                Python patch decorator spilling into other methods

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Converting long integers to strings in pandas (to avoid scientific notation)

Tags:

python

pandas

long-integer

HaPsantran

People also ask

1 Answers

Joe

Recent Activity

Donate For Us