Consider below <code>df</code>: <pre class="prettyprint"><code> IA1 IA2 IA3 Name Subject Abc DS 45 43 34 DMS 43 23 45 ADA 32 46 36 Bcd BA 45 35 37 EAD 23 45 12 DS 23 35 43 Cdf EAD 34 33 23 ADA 12 34 25 </code></pre> How can I add an empty row after each <code>Name</code> index? Expected output: <pre class="prettyprint"><code> IA1 IA2 IA3 Name Subject Abc DS 45 43 34 DMS 43 23 45 ADA 32 46 36 Bcd BA 45 35 37 EAD 23 45 12 DS 23 35 43 Cdf EAD 34 33 23 ADA 12 34 25 </code></pre>

Use custom function for add empty rows in <code>GroupBy.apply</code>: <pre class="prettyprint"><code>def f(x): x.loc[('', ''), :] = '' return x </code></pre> Or: <pre class="prettyprint"><code>def f(x): return x.append(pd.DataFrame('', columns=df.columns, index=[(x.name, '')])) </code></pre> <hr> <pre class="prettyprint"><code>df = df.groupby(level=0, group_keys=False).apply(f) print (df) IA1 IA2 IA3 Name Subject Abc DS 45 43 34 DMS 43 23 45 ADA 32 46 36 Bcd BA 45 35 37 EAD 23 45 12 DS 23 35 43 Cdf EAD 34 33 23 ADA 12 34 25 </code></pre>

Adding another way using <code>df.reindex</code> and <code>fill_value</code> as <code>''</code> after using <code>pd.MultiIndex.from_product</code> and <code>Index.union</code> and then sorting it. <pre class="prettyprint"><code>idx = df.index.union(pd.MultiIndex.from_product((df.index.levels[0],[''])),sort=False) out = df.reindex(sorted(idx,key=lambda x: x[0]),fill_value='') </code></pre> <hr> <pre class="prettyprint"><code>print(out) IA1 IA2 IA3 Name Subject Abc DS 45 43 34 DMS 43 23 45 ADA 32 46 36 Bcd BA 45 35 37 EAD 23 45 12 DS 23 35 43 Cdf EAD 34 33 23 ADA 12 34 25 </code></pre> <hr> We use <code>sort=False</code> when using <code>Index.union</code> the index so order is retained , then using <code>sorted</code> on the first element returns: <pre class="prettyprint"><code>sorted(idx,key=lambda x:x[0]) [('Abc', 'DS'), ('Abc', 'DMS'), ('Abc', 'ADA'), ('Abc', ''), ('Bcd', 'BA'), ('Bcd', 'EAD'), ('Bcd', 'DS'), ('Bcd', ''), ('Cdf', 'EAD'), ('Cdf', 'ADA'), ('Cdf', '')] </code></pre>

Pandas: Add an empty row after every index in a MultiIndex dataframe

Tags:

python

python-3.x

pandas

dataframe

multi-index

Consider below df:

              IA1  IA2  IA3
Name Subject               
Abc  DS        45   43   34
     DMS       43   23   45
     ADA       32   46   36
Bcd  BA        45   35   37
     EAD       23   45   12
     DS        23   35   43
Cdf  EAD       34   33   23
     ADA       12   34   25

How can I add an empty row after each Name index?

Expected output:

              IA1  IA2  IA3
Name Subject               
Abc  DS        45   43   34
     DMS       43   23   45
     ADA       32   46   36

Bcd  BA        45   35   37
     EAD       23   45   12
     DS        23   35   43

Cdf  EAD       34   33   23
     ADA       12   34   25

721

asked Jan 12 '21 07:01

Mayank Porwal

2 Answers

Use custom function for add empty rows in GroupBy.apply:

def f(x):
    x.loc[('', ''), :] = ''
    return x

Or:

def f(x):
    return x.append(pd.DataFrame('', columns=df.columns, index=[(x.name, '')]))

df = df.groupby(level=0, group_keys=False).apply(f)
print (df)
             IA1 IA2 IA3
Name Subject            
Abc  DS       45  43  34
     DMS      43  23  45
     ADA      32  46  36
                        
Bcd  BA       45  35  37
     EAD      23  45  12
     DS       23  35  43
                        
Cdf  EAD      34  33  23
     ADA      12  34  25

117

answered Oct 23 '22 16:10

jezrael

Adding another way using df.reindex and fill_value as '' after using pd.MultiIndex.from_product and Index.union and then sorting it.

idx = df.index.union(pd.MultiIndex.from_product((df.index.levels[0],[''])),sort=False)
out = df.reindex(sorted(idx,key=lambda x: x[0]),fill_value='')

print(out)

             IA1 IA2 IA3
Name Subject            
Abc  DS       45  43  34
     DMS      43  23  45
     ADA      32  46  36
                        
Bcd  BA       45  35  37
     EAD      23  45  12
     DS       23  35  43
                        
Cdf  EAD      34  33  23
     ADA      12  34  25

We use sort=False when using Index.union the index so order is retained , then using sorted on the first element returns:

sorted(idx,key=lambda x:x[0])

[('Abc', 'DS'),
 ('Abc', 'DMS'),
 ('Abc', 'ADA'),
 ('Abc', ''),
 ('Bcd', 'BA'),
 ('Bcd', 'EAD'),
 ('Bcd', 'DS'),
 ('Bcd', ''),
 ('Cdf', 'EAD'),
 ('Cdf', 'ADA'),
 ('Cdf', '')]

answered Oct 23 '22 17:10

anky

Related questions
                            
                                how to use to_categorical when using ImageDataGenerator
                            
                                Python 3 generator comprehension to generate chunks including last
                            
                                Keras, Tensorflow: How to set breakpoint (debug) in custom layer when evaluating?
                            
                                f-string formula inside curly brackets not working
                            
                                Union of two pandas DataFrames
                            
                                calling sync functions from async function
                            
                                Accessing all cookies in the Flask test response
                            
                                ModuleNotFoundError: No module named 'tensorflow_docs' when creating TensorFlow docs
                            
                                Pytorch custom activation functions?
                            
                                Pass cookie header with Flask test client request
                            
                                "which conda" command returns something not expected
                            
                                Why are True and False being set in globals by this code?
                            
                                Keras Sequential without providing input shape
                            
                                (Keras) ValueError: Failed to convert a NumPy array to a Tensor (Unsupported object type float)
                            
                                InvalidArgumentError : input depth must be evenly divisible by filter depth: 4 vs 3
                            
                                "[CRITICAL] WORKER TIMEOUT" in logs when running "Hello Cloud Run with Python" from GCP Setup Docs
                            
                                Is there any way to draw INDIA Map in plotly?
                            
                                How does setColumnStretch and setRowStretch works
                            
                                UnboundLocalError: local variable 'batch_outputs' referenced before assignment
                            
                                Greenlet runtime error and deployed app in docker keeps booting all the workers

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With