Python - pandas - Append Series into Blank DataFrame

Tags:

Say I have two pandas Series in python:

import pandas as pd
h = pd.Series(['g',4,2,1,1])
g = pd.Series([1,6,5,4,"abc"])

I can create a DataFrame with just h and then append g to it:

df = pd.DataFrame([h])
df1 = df.append(g, ignore_index=True)

I get:

>>> df1
   0  1  2  3    4
0  g  4  2  1    1
1  1  6  5  4  abc

But now suppose that I have an empty DataFrame and I try to append h to it:

df2 = pd.DataFrame([])
df3 = df2.append(h, ignore_index=True)

This does not work. I think the problem is in the second-to-last line of code. I need to somehow define the blank DataFrame to have the proper number of columns.

By the way, the reason I am trying to do this is that I am scraping text from the internet using requests+BeautifulSoup and I am processing it and trying to write it to a DataFrame one row at a time.

651

asked May 31 '14 21:05

bill999

1 Answers

So if you don't pass an empty list to the DataFrame constructor then it works:

In [16]:

df = pd.DataFrame()
h = pd.Series(['g',4,2,1,1])
df = df.append(h,ignore_index=True)
df
Out[16]:
   0  1  2  3  4
0  g  4  2  1  1

[1 rows x 5 columns]

The difference between the two constructor approaches appears to be that the index dtypes are set differently, with an empty list it is an Int64 with nothing it is an object:

In [21]:

df = pd.DataFrame()
print(df.index.dtype)
df = pd.DataFrame([])
print(df.index.dtype)
object
int64

Unclear to me why the above should affect the behaviour (I'm guessing here).

UPDATE

After revisiting this I can confirm that this looks to me to be a bug in pandas version 0.12.0 as your original code works fine:

In [13]:

import pandas as pd
df = pd.DataFrame([])
h = pd.Series(['g',4,2,1,1])
df.append(h,ignore_index=True)

Out[13]:
   0  1  2  3  4
0  g  4  2  1  1

[1 rows x 5 columns]

I am running pandas 0.13.1 and numpy 1.8.1 64-bit using python 3.3.5.0 but I think the problem is pandas but I would upgrade both pandas and numpy to be safe, I don't think this is a 32 versus 64-bit python issue.

170

answered Sep 22 '22 03:09

EdChum

Related questions
                            
                                what does exclude in the meta class of django mean?
                            
                                How to resize column to content in ReportLab?
                            
                                Parallel I/O - why does it work?
                            
                                How do I convert a pandas pivot table to a dataframe
                            
                                How to solve matrix equation with sympy?
                            
                                How to group elements of a numpy array with the same value in separate numpy arrays
                            
                                Python clear the screen
                            
                                python matplotlib: how to automatically save figures in .fig format?
                            
                                Ran Pycharm debug which ended with exit code -1
                            
                                Is there a built-in function like Perl's splice in Python?
                            
                                Matplotlib not listening to font choices
                            
                                Paring Down a Dictionary of Lists in Python
                            
                                Reading large text files with Pandas [duplicate]
                            
                                Can't seem to remove "ns0:" namespace declaration [duplicate]
                            
                                Difference in tornado.gen.engine v/s tornado.gen.coroutine
                            
                                Importing from main app in a flask blueprint
                            
                                CSS parsing error when creating pdf with xhtml2pdf pisa.CreatePDF()
                            
                                <sqlite3.Row object at 0x1017fe3f0> instead of database contents
                            
                                How to make pytest display a custom string representation for fixture parameters?
                            
                                Using Flask-Security to authenticate REST API

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python - pandas - Append Series into Blank DataFrame

Tags:

python

pandas

dataframe

matrix

bill999

People also ask

1 Answers

EdChum

Recent Activity

Donate For Us