I'm using the excellent <code>read_csv()</code>function from pandas, which gives: <pre class="prettyprint"><code>In [31]: data = pandas.read_csv("lala.csv", delimiter=",") In [32]: data Out[32]: <class 'pandas.core.frame.DataFrame'> Int64Index: 12083 entries, 0 to 12082 Columns: 569 entries, REGIONC to SCALEKER dtypes: float64(51), int64(518) </code></pre> but when i apply a function from scikit-learn i loose the informations about columns: <pre class="prettyprint"><code>from sklearn import preprocessing preprocessing.scale(data) </code></pre> gives numpy array. Is there a way to apply scikit or numpy function to DataFrames without loosing the information?

This can be done by wrapping the returned data in a dataframe, with <code>index</code> and <code>columns</code> information in. <pre class="prettyprint"><code>import pandas as pd pd.DataFrame(preprocessing.scale(data), index = data.index, columns = data.columns) </code></pre>

Keep pandas structure with numpy/scikit functions

Tags:

python

pandas

numpy

scikit-learn

I'm using the excellent read_csv()function from pandas, which gives:

Click to copy

In [31]: data = pandas.read_csv("lala.csv", delimiter=",")

In [32]: data
Out[32]: 
<class 'pandas.core.frame.DataFrame'>
Int64Index: 12083 entries, 0 to 12082
Columns: 569 entries, REGIONC to SCALEKER
dtypes: float64(51), int64(518)

but when i apply a function from scikit-learn i loose the informations about columns:

Click to copy

from sklearn import preprocessing
preprocessing.scale(data)

gives numpy array.

Is there a way to apply scikit or numpy function to DataFrames without loosing the information?

759

asked Feb 11 '13 13:02

Mermoz

1 Answers

This can be done by wrapping the returned data in a dataframe, with index and columns information in.

Click to copy

import pandas as pd
pd.DataFrame(preprocessing.scale(data), index = data.index, columns = data.columns)

192

answered Sep 28 '22 17:09

Mermoz

Related questions
                            
                                How to sub with matched groups and variables in Python
                            
                                How to call a Python Script from PHP?
                            
                                Python: One-liner to perform an operation upon elements in a 2d array (list of lists)?
                            
                                Yielding from sorted iterators in sorted order in Python?
                            
                                Plot with non-numerical data on x axis (for ex., dates)
                            
                                Best python style for complex one-liners
                            
                                I can't understand polling/select in python
                            
                                Can I use an object (an instance of a class) as a dictionary key in Python?
                            
                                Is PyCrypto safe and reliable to use?
                            
                                What does `if name == "__main__"` mean in Python? [duplicate]
                            
                                How can I print a string using .format(), and print literal curly brackets around my replaced string [duplicate]
                            
                                How to join/merge two generators output using python
                            
                                What would you use the heapq Python module for in real life?
                            
                                Switch in Python
                            
                                Write xml utf-8 file with utf-8 data with ElementTree
                            
                                sys.stdin.readlines() hangs Python script
                            
                                BeautifulSoup .text method returns text without separators (\n, \r etc)
                            
                                Python list intersection with non unique items
                            
                                Python 3: Asterisk in print function
                            
                                SQLAlchemy: Check if object is already present in table

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Keep pandas structure with numpy/scikit functions

Tags:

python

pandas

numpy

scikit-learn

Mermoz

People also ask

1 Answers

Mermoz

Recent Activity

Donate For Us