How to construct pandas dataframe from series of arrays

Tags:

Hi I have the following pandas Series of numpy arrays:

 datetime
    03-Sep-15     [53.5688348969, 31.2542494769, 18.002043765]
    04-Sep-15     [46.845084292, 27.0833015735, 15.5997887379]
    08-Sep-15    [52.8701581666, 30.7347431703, 17.6379377917]
    09-Sep-15    [47.9535624339, 27.7063099999, 15.9126963643]
    10-Sep-15     [51.2900606534, 29.600945626, 16.8756260105]

Do you know how I could convert it into a dataframe with 3 columns? Thanks!

912

asked Sep 15 '15 19:09

NickD1

2 Answers

Feeding a list of lists to pd.DataFrame is a more efficient approach:

s = pd.Series([np.array([53.5688348969, 31.2542494769, 18.002043765]),
               np.array([46.845084292, 27.0833015735, 15.5997887379]),
               np.array([52.8701581666, 30.7347431703, 17.6379377917]),
               np.array([47.9535624339, 27.7063099999, 15.9126963643]),
               np.array([51.2900606534, 29.600945626, 16.8756260105])],
              index=['03-Sep-15', '04-Sep-15', '08-Sep-15', '09-Sep-15', '10-Sep-15'])

df = pd.DataFrame(s.values.tolist(), index=s.index)

print(df)

                   0          1          2
03-Sep-15  53.568835  31.254249  18.002044
04-Sep-15  46.845084  27.083302  15.599789
08-Sep-15  52.870158  30.734743  17.637938
09-Sep-15  47.953562  27.706310  15.912696
10-Sep-15  51.290061  29.600946  16.875626

Benchmarking on Python 3.6 / Pandas 0.19:

%timeit pd.DataFrame(s.values.tolist(), index=s.index)  # 448 µs per loop
%timeit s.apply(pd.Series)                              # 1.5 ms per loop

answered Oct 17 '22 12:10

jpp

It won't be super-performant, but you should be able to apply(pd.Series):

>>> ser
03-Sep-15     [53.5688348969, 31.2542494769, 18.002043765]
04-Sep-15     [46.845084292, 27.0833015735, 15.5997887379]
08-Sep-15    [52.8701581666, 30.7347431703, 17.6379377917]
09-Sep-15    [47.9535624339, 27.7063099999, 15.9126963643]
10-Sep-15     [51.2900606534, 29.600945626, 16.8756260105]
dtype: object
>>> type(ser.values[0])
<class 'numpy.ndarray'>
>>> ser.apply(pd.Series)
                   0          1          2
03-Sep-15  53.568835  31.254249  18.002044
04-Sep-15  46.845084  27.083302  15.599789
08-Sep-15  52.870158  30.734743  17.637938
09-Sep-15  47.953562  27.706310  15.912696
10-Sep-15  51.290061  29.600946  16.875626

answered Oct 17 '22 12:10

DSM

Related questions
                            
                                How can I serialize a MongoDB ObjectId with Marshmallow?
                            
                                What does the Python InsecureRequestWarning really mean?
                            
                                flake8 doesn't report mixed-case function names
                            
                                'utf-8' codec can't decode byte reading a file in Python3.4 but not in Python2.7
                            
                                Given n tuples representing pairs, return a list with connected tuples
                            
                                Django migration file in an other app?
                            
                                Python pyqtgraph how to set x and y axis limits on graph, no autorange
                            
                                pkg_resources.resource_stream fails on python3
                            
                                Why do some includes in Django need strings, and others variable names?
                            
                                Save image with matplotlib.pyplot [duplicate]
                            
                                Create "The Economist" style graphs from python
                            
                                Index Error: list index out of range in Django
                            
                                Refering to a directory in a Flask app doesn't work unless the path is absolute
                            
                                Plot circular gradients using PIL in Python
                            
                                Flatten numpy array but also keep index of value positions?
                            
                                Parse XML Sitemap with Python
                            
                                How to add newline to end of file.write()?
                            
                                Why does python allow spaces between an object and the method name after the "."
                            
                                Python Fuzzy Matching (FuzzyWuzzy) - Keep only Best Match
                            
                                GitPython: Get current tag (detached head)

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to construct pandas dataframe from series of arrays

Tags:

python

arrays

pandas

dataframe

numpy

NickD1

People also ask

2 Answers

jpp

DSM

Recent Activity

Donate For Us