Add a new row to a Pandas DataFrame with specific index name

Tags:

I'm trying to add a new row to the DataFrame with a specific index name 'e'.

    number   variable       values
a    NaN       bank          true   
b    3.0       shop          false  
c    0.5       market        true   
d    NaN       government    true

I have tried the following but it's creating a new column instead of a new row.

new_row = [1.0, 'hotel', 'true']
df = df.append(new_row)

Still don't understand how to insert the row with a specific index. Will be grateful for any suggestions.

295

asked Oct 07 '17 15:10

samba

3 Answers

You can use df.loc[_not_yet_existing_index_label_] = new_row.

Demo:

In [3]: df.loc['e'] = [1.0, 'hotel', 'true']

In [4]: df
Out[4]:
   number    variable values
a     NaN        bank   True
b     3.0        shop  False
c     0.5      market   True
d     NaN  government   True
e     1.0       hotel   true

PS using this method you can't add a row with already existing (duplicate) index value (label) - a row with this index label will be updated in this case.

UPDATE:

This might not work in recent Pandas/Python3 if the index is a DateTimeIndex and the new row's index doesn't exist.

it'll work if we specify correct index value(s).

Demo (using pandas: 0.23.4):

In [17]: ix = pd.date_range('2018-11-10 00:00:00', periods=4, freq='30min')

In [18]: df = pd.DataFrame(np.random.randint(100, size=(4,3)), columns=list('abc'), index=ix)

In [19]: df
Out[19]:
                      a   b   c
2018-11-10 00:00:00  77  64  90
2018-11-10 00:30:00   9  39  26
2018-11-10 01:00:00  63  93  72
2018-11-10 01:30:00  59  75  37

In [20]: df.loc[pd.to_datetime('2018-11-10 02:00:00')] = [100,100,100]

In [21]: df
Out[21]:
                       a    b    c
2018-11-10 00:00:00   77   64   90
2018-11-10 00:30:00    9   39   26
2018-11-10 01:00:00   63   93   72
2018-11-10 01:30:00   59   75   37
2018-11-10 02:00:00  100  100  100

In [22]: df.index
Out[22]: DatetimeIndex(['2018-11-10 00:00:00', '2018-11-10 00:30:00', '2018-11-10 01:00:00', '2018-11-10 01:30:00', '2018-11-10 02:00:00'], dtype='da
tetime64[ns]', freq=None)

200

answered Oct 15 '22 20:10

MaxU - stop WAR against UA

Use append by converting list a dataframe in case you want to add multiple rows at once i.e

df = df.append(pd.DataFrame([new_row],index=['e'],columns=df.columns))

Or for single row (Thanks @Zero)

df = df.append(pd.Series(new_row, index=df.columns, name='e'))

Output:

  number    variable values
a     NaN        bank   True
b     3.0        shop  False
c     0.5      market   True
d     NaN  government   True
e     1.0       hotel   true

answered Oct 15 '22 20:10

Bharath

If it's the first row you need:

df = Dataframe(columns=[number, variable, values])
df.loc['e', [number, variable, values]] = [1.0, 'hotel', 'true']

answered Oct 15 '22 19:10

Kim Miller

Related questions
                            
                                Why does list.append evaluate to false in a boolean context? [duplicate]
                            
                                Python CSV DictReader with UTF-8 data
                            
                                How to retry urllib2.request when fails?
                            
                                multipart data POST using python requests: no multipart boundary was found
                            
                                Celery and Django simple example
                            
                                Import urllib.request, ImportError: No module named request
                            
                                Using a coroutine as decorator
                            
                                Keras: How to get layer shapes in a Sequential model
                            
                                Fitting a closed curve to a set of points
                            
                                Silence PyLint warning about unused variables for string interpolation
                            
                                When I run test cases I get this error: psycopg2.OperationalError: cursor "_django_curs_140351416325888_23" does not exist
                            
                                Making letters uppercase using re.sub in python?
                            
                                Strip white spaces from CSV file
                            
                                Capture video data from screen in Python
                            
                                Should a class convert types of the parameters at init time? If so, how?
                            
                                bytes vs bytearray in Python 2.6 and 3
                            
                                Is it possible to convert a list-type into a generator without iterating through?
                            
                                how to convert negative integer value to hex in python
                            
                                DJANGO: ModelChoiceField optgroup tag
                            
                                Parse a Pandas column to Datetime when importing table from SQL database and filtering rows by date

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Add a new row to a Pandas DataFrame with specific index name

Tags:

python

pandas

dataframe