Pandas DataFrame stack multiple column values into single column

Tags:

Assuming the following DataFrame:

  key.0 key.1 key.2  topic 1   abc   def   ghi      8 2   xab   xcd   xef      9

How can I combine the values of all the key.* columns into a single column 'key', that's associated with the topic value corresponding to the key.* columns? This is the result I want:

   topic  key 1      8  abc 2      8  def 3      8  ghi 4      9  xab 5      9  xcd 6      9  xef

Note that the number of key.N columns is variable on some external N.

999

asked Dec 19 '15 22:12

borice

1 Answers

You can melt your dataframe:

>>> keys = [c for c in df if c.startswith('key.')] >>> pd.melt(df, id_vars='topic', value_vars=keys, value_name='key')     topic variable  key 0      8    key.0  abc 1      9    key.0  xab 2      8    key.1  def 3      9    key.1  xcd 4      8    key.2  ghi 5      9    key.2  xef

It also gives you the source of the key.

From v0.20, melt is a first class function of the pd.DataFrame class:

>>> df.melt('topic', value_name='key').drop('variable', 1)     topic  key 0      8  abc 1      9  xab 2      8  def 3      9  xcd 4      8  ghi 5      9  xef

129

answered Sep 21 '22 19:09

Alexander

Related questions
                            
                                Recursion: how to avoid Python set changed set during iteration RuntimeError
                            
                                write numpy ndarray to Image
                            
                                How to convert tuple of tuples to pandas.DataFrame in Python?
                            
                                How to run only one test in tox?
                            
                                Python multiprocessing installation: Command "python setup.py egg_info" failed with error code 1
                            
                                Pytorch: how to add L1 regularizer to activations?
                            
                                aws lambda Unable to import module 'lambda_function': No module named 'requests'
                            
                                How do you override vim options via comments in a python source code file?
                            
                                django content types - how to get model class of content type to create a instance?
                            
                                How to handle urllib's timeout in Python 3?
                            
                                What is a reference cycle in python?
                            
                                class variables is shared across all instances in python? [duplicate]
                            
                                How do I change button size in Python?
                            
                                InvalidRequestError: VARCHAR requires a length on dialect mysql
                            
                                Python OrderedDict iteration
                            
                                Trouble installing private github repository using pip
                            
                                How to make Ipython output a list without line breaks after elements?
                            
                                Overloading Addition, Subtraction, and Multiplication Operators
                            
                                Transpose nested list in python
                            
                                Pandas Correlation Groupby

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Pandas DataFrame stack multiple column values into single column

Tags:

python

pandas

dataframe

melt

borice

People also ask

1 Answers

Alexander

Recent Activity

Donate For Us