Multiindex only some of columns in Pandas

Tags:

I have a csv which is generated in a format that I can not change. The file has a multi index. The file looks like this.

enter image description here

The end goal is to turn the top row (hours) into an index, and index it with the "ID" column, so that the data looks like this.

enter image description here

I have imported the file into pandas...

Click to copy

myfile = 'c:/temp/myfile.csv'
df = pd.read_csv(myfile, header=[0, 1], tupleize_cols=True)
pd.set_option('display.multi_sparse', False)
df.columns = pd.MultiIndex.from_tuples(df.columns, names=['hour', 'field'])
df

But that gives me three unnamed fields:

enter image description here

My final step is to stack on hour:

Click to copy

df.stack(level=['hour'])

But I a missing what comes before that, where I can index the other columns, even though there's a blank multiindex line above them.

550

asked Mar 11 '16 23:03

Sir Larry Wildman

1 Answers

I believe the lines you are missing may be # 3 and 4:

Click to copy

df = pd.io.parsers.read_csv('temp.csv', header = [0,1], tupleize_cols = True)
df.columns = [c for _, c in df.columns[:3]] + [c for c in df.columns[3:]]
df = df.set_index(list(df.columns[:3]), append = True)
df.columns = pd.MultiIndex.from_tuples(df.columns, names = ['hour', 'field'])

Convert the tuples to strings by dropping the first value for first 3 col. headers.
Shelter these headers by placing them in an index.

After you perform the stack, you may reset the index if you like.

e.g.

Before

Click to copy

  (Unnamed: 0_level_0, Date)  (Unnamed: 1_level_0, id)  \
0                  3/11/2016                         5   
1                  3/11/2016                         6   

  (Unnamed: 2_level_0, zone)  (100, p1)  (100, p2)  (200, p1)  (200, p2)  
0                        abc      0.678      0.787      0.337      0.979  
1                        abc      0.953      0.559      0.776      0.520

After

Click to copy

field                        p1     p2
  Date      id zone hour              
0 3/11/2016 5  abc  100   0.678  0.787
                    200   0.337  0.979
1 3/11/2016 6  abc  100   0.953  0.559
                    200   0.776  0.520

160

answered Sep 28 '22 10:09

hilberts_drinking_problem

Related questions
                            
                                From Matlab to Python - Solve function
                            
                                reshape numpy 3D array to 2D
                            
                                Using Django with virtualenv, get error ImportError: No module named 'django.core.servers.fastcgi'
                            
                                Find word not surrounded by alpha char
                            
                                Strange behavior of capturing group in regular expression
                            
                                How to access individual predictions in Spark RandomForest?
                            
                                Python: How to mock class attribute initializer function
                            
                                How to clear a plot in a `while` loop when using PyQtGraph?
                            
                                Python sorting dictionaries: Key [Ascending] and then Value [Descending]
                            
                                Matplotlib normalize colorbar (Python)
                            
                                Summary statistics on Large csv file using python pandas
                            
                                Count of unequal elements across numpy arrays
                            
                                Replacing punctuation except intra-word dashes with a space
                            
                                Should I generate *.pyc files when deploying?
                            
                                Scrapy + Splash + ScrapyJS
                            
                                Changing multiple characters by other characters in a string [duplicate]
                            
                                How can I enumerate rows in groups with Spark/Python?
                            
                                How can I get the Python compiler string programmatically?
                            
                                Keras - is it possible to view the weights and biases of models in Tensorboard
                            
                                Wrapping around a list as a slice operation

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Multiindex only some of columns in Pandas

Tags:

python

pandas

dataframe

multi-index

Sir Larry Wildman

People also ask

1 Answers

hilberts_drinking_problem

Recent Activity

Donate For Us