This is similar to the problem I asked here. However, I found out that the data I am working is not always consistent. For, example say : <pre class="prettyprint"><code>import pandas as pd df = pd.DataFrame(pd.DataFrame([[1,2,3,4],[5,6,7,8],[9,10,11,12]],columns=["X_a","Y_c","X_b","Y_a"])) X_a Y_c X_b Y_a 0 1 2 3 4 1 5 6 7 8 2 9 10 11 12 </code></pre> Now you can see that <code>X</code> does not have corresponding <code>c</code> column and <code>Y</code> does not have corresponding <code>b</code> column. Now when I want to create the multi-level index, I want the dataframe to look like this: <pre class="prettyprint"><code> X Y a b c a b c 0 1 3 -1 4 -1 2 1 5 7 -1 8 -1 6 2 9 11 -1 12 -1 10 </code></pre> So as you can see, I want the split in such a way that all upper level columns should have the same lower level columns. Since, the dataset is positve, I am thinking of filling the missing columns with -1, although I am open for suggestions on this. The closest thing I found to my problem was this answer. However, I cannot make it to somehow work with MultiLevel Index like in my previous question. Any help is appreciated.

Create a <code>MultiIndex</code> and set <code>df.columns</code>. <pre class="prettyprint"><code>idx = df.columns.str.split('_', expand=True) idx MultiIndex(levels=[['X', 'Y'], ['a', 'b', 'c']], labels=[[0, 1, 0, 1], [0, 2, 1, 0]]) df.columns = idx </code></pre> Now, with the existing <code>MultiIndex</code>, create a new index and use that to <code>reindex</code> the original. <pre class="prettyprint"><code>idx = pd.MultiIndex.from_product([idx.levels[0], idx.levels[1]]) idx MultiIndex(levels=[['X', 'Y'], ['a', 'b', 'c']], labels=[[0, 0, 0, 1, 1, 1], [0, 1, 2, 0, 1, 2]]) df.reindex(columns=idx, fill_value=-1) X Y a b c a b c 0 1 3 -1 4 -1 2 1 5 7 -1 8 -1 6 2 9 11 -1 12 -1 10 </code></pre>

Split columns into MultiIndex with missing columns in pandas

Tags:

python

pandas

dataframe

multi-index

This is similar to the problem I asked here. However, I found out that the data I am working is not always consistent. For, example say :

import pandas as pd

df = pd.DataFrame(pd.DataFrame([[1,2,3,4],[5,6,7,8],[9,10,11,12]],columns=["X_a","Y_c","X_b","Y_a"]))

   X_a  Y_c  X_b  Y_a
0    1    2    3    4
1    5    6    7    8
2    9   10   11   12

Now you can see that X does not have corresponding c column and Y does not have corresponding b column. Now when I want to create the multi-level index, I want the dataframe to look like this:

     X             Y
     a    b   c    a    b   c
0    1    3   -1   4   -1   2
1    5    7   -1   8   -1   6
2    9   11   -1  12   -1  10

So as you can see, I want the split in such a way that all upper level columns should have the same lower level columns. Since, the dataset is positve, I am thinking of filling the missing columns with -1, although I am open for suggestions on this. The closest thing I found to my problem was this answer. However, I cannot make it to somehow work with MultiLevel Index like in my previous question. Any help is appreciated.

695

asked Sep 16 '17 06:09

Gambit1614

1 Answers

Create a MultiIndex and set df.columns.

idx = df.columns.str.split('_', expand=True)
idx
MultiIndex(levels=[['X', 'Y'], ['a', 'b', 'c']],
           labels=[[0, 1, 0, 1], [0, 2, 1, 0]])

df.columns = idx

Now, with the existing MultiIndex, create a new index and use that to reindex the original.

idx = pd.MultiIndex.from_product([idx.levels[0], idx.levels[1]])
idx
MultiIndex(levels=[['X', 'Y'], ['a', 'b', 'c']],
       labels=[[0, 0, 0, 1, 1, 1], [0, 1, 2, 0, 1, 2]])

df.reindex(columns=idx, fill_value=-1)
   X          Y       
   a   b  c   a  b   c
0  1   3 -1   4 -1   2
1  5   7 -1   8 -1   6
2  9  11 -1  12 -1  10

answered Oct 11 '22 05:10

cs95

Related questions
                            
                                How to fill numpy array with another numpy array
                            
                                How do I fill a region with only hatch (no background colour) in matplotlib 2.0
                            
                                '::hypot' has not been declared
                            
                                How to Calculate R^2 in Tensorflow
                            
                                Python code to multiply two columns and then create new column with values
                            
                                How to resolve ImportError in Gurobi?
                            
                                Why can't I access .__mro__ attribute here?
                            
                                Why this error when I try to create workspaces in ROS?
                            
                                Remove Dollar Sign from Entire Python Pandas Dataframe
                            
                                Write pandas dataframe as compressed CSV directly to Amazon s3 bucket?
                            
                                Convert nd array to key, value dictionary
                            
                                Split string into groups of 3 characters [duplicate]
                            
                                Converting DDMMYYYY with dateutil.parser
                            
                                Building Tensorflow Graphs Inside of Functions
                            
                                sudo: python: command not found
                            
                                How to check if a pandas Series contains Timestamps?
                            
                                Python Cryptography module save/load RSA keys to/from file
                            
                                Django model choice field from another model instance
                            
                                How does the "in" and "not in" statement work in python
                            
                                Tensorflow ConcatOp Error with Object Detection API

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With