Specifying dtype float32 with pandas.read_csv on pandas 0.10.1

People also ask

What output type does pandas read_csv () return?

Read a CSV File In this case, the Pandas read_csv() function returns a new DataFrame with the data and labels from the file data. csv , which you specified with the first argument.

What does parse_dates in pandas do?

If True and parse_dates is enabled, pandas will attempt to infer the format of the datetime strings in the columns, and if it can be inferred, switch to a faster method of parsing them. In some cases this can increase the parsing speed by 5-10x.

0.10.1 doesn't really support float32 very much

see this http://pandas.pydata.org/pandas-docs/dev/whatsnew.html#dtype-specification

you can do this in 0.11 like this:

# dont' use dtype converters explicity for the columns you care about
# they will be converted to float64 if possible, or object if they cannot
df = pd.read_csv('test.csv'.....)

#### this is optional and related to the issue you posted ####
# force anything that is not a numeric to nan
# columns are the list of columns that you are interesetd in
df[columns] = df[columns].convert_objects(convert_numeric=True)


    # astype
    df[columns] = df[columns].astype('float32')

see http://pandas.pydata.org/pandas-docs/dev/basics.html#object-conversion

Its not as efficient as doing it directly in read_csv (but that requires
 some low-level changes)

I have confirmed that with 0.11-dev, this DOES work (on 32-bit and 64-bit, results are the same)

In [5]: x = pd.read_csv(StringIO.StringIO(data), dtype={'a': np.float32}, delim_whitespace=True)

In [6]: x
Out[6]: 
         a        b
0  0.76398  0.81394
1  0.32136  0.91063

In [7]: x.dtypes
Out[7]: 
a    float32
b    float64
dtype: object

In [8]: pd.__version__
Out[8]: '0.11.0.dev-385ff82'

In [9]: quit()
vagrant@precise32:~/pandas$ uname -a
Linux precise32 3.2.0-23-generic-pae #36-Ubuntu SMP Tue Apr 10 22:19:09 UTC 2012 i686 i686 i386 GNU/Linux

In [22]: df.a.dtype = pd.np.float32

In [23]: df.a.dtype
Out[23]: dtype('float32')

the above works fine for me under pandas 0.10.1

Related questions
                            
                                How to suppress pip upgrade warning?
                            
                                Shortest way of creating an object with arbitrary attributes in Python?
                            
                                Convert string into Date type on Python [duplicate]
                            
                                error: could not create '/Library/Python/2.7/site-packages/xlrd': Permission denied
                            
                                How do you alias a python class to have another name without using inheritance?
                            
                                What is the best stemming method in Python?
                            
                                scikit's GridSearch and Python in general are not freeing memory
                            
                                Emacs Inferior Python shell shows the send message with each python-shell-send-region command
                            
                                AppEngine bulkloader, high replication store and python27 runtime
                            
                                Logistic Regression PMML won't Produce Probabilities
                            
                                Out-of-core processing of sparse CSR arrays
                            
                                How can I define algebraic data types in Python?
                            
                                Python setuptools: how to include a config file for distribution into <prefix>/etc
                            
                                SQLAlchemy: Hybrid expression with relationship
                            
                                Can I write native iPhone, Android, Windows, Blackberry apps using Python? [duplicate]
                            
                                Return results from multiple models with Django REST Framework
                            
                                Why isn't __new__ in Python new-style classes a class method?
                            
                                Plug in django-allauth as endpoint in django-rest-framework
                            
                                Difference between different ways to create celery task
                            
                                Flask App: Update progress bar while function runs

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Specifying dtype float32 with pandas.read_csv on pandas 0.10.1

Tags:

python

pandas

numpy

People also ask

Recent Activity

Donate For Us