I have a table like this <pre class="prettyprint"><code> user company company2 company3 company4 1 Mac Lenovo Hp null 2 Mac MSI Sony </code></pre> And using pandas I would like it to be <pre class="prettyprint"><code> user company 1 Mac 1 Lenovo 1 Hp 2 Mac </code></pre> and so on Here I tried it but didnt work with pandas pivot. <pre class="prettyprint"><code>dataframe = pd.read_csv('data.csv') dataframe.fillna(value='', inplace=True) #dataframe.pivot(index='user', columns='company') </code></pre> Above code doesnt work and gives error.

you can use pd.melt method: <pre class="prettyprint"><code>In [211]: pd.melt(df, id_vars='user', value_vars=df.columns.drop('user').tolist()) Out[211]: user variable value 0 1 company Mac 1 2 company Mac 2 1 company2 Lenovo 3 2 company2 MSI 4 1 company3 Hp 5 2 company3 Sony 6 1 company4 null 7 2 company4 NaN </code></pre> or <pre class="prettyprint"><code>In [213]: pd.melt(df, id_vars='user', value_vars=df.columns.drop('user').tolist(), value_name='Company') \ .drop('variable',1) Out[213]: user Company 0 1 Mac 1 2 Mac 2 1 Lenovo 3 2 MSI 4 1 Hp 5 2 Sony 6 1 null 7 2 NaN </code></pre> UPDATE: dropping NaN's and sorting resulting DF by <code>user</code>: <pre class="prettyprint"><code>In [218]: pd.melt(df, ...: id_vars='user', value_vars=df.columns.drop('user').tolist(), ...: value_name='Company') \ ...: .drop('variable',1) \ ...: .dropna() \ ...: .sort_values('user') ...: Out[218]: user Company 0 1 Mac 2 1 Lenovo 4 1 Hp 6 1 null 1 2 Mac 3 2 MSI 5 2 Sony </code></pre> PS if you want to get rid of <code>null</code> values - use <code>df.replace('null', np.nan)</code> instead of <code>df</code>: <pre class="prettyprint"><code>In [219]: pd.melt(df.replace('null', np.nan), ...: id_vars='user', value_vars=df.columns.drop('user').tolist(), ...: value_name='Company') \ ...: .drop('variable',1) \ ...: .dropna() \ ...: .sort_values('user') ...: Out[219]: user Company 0 1 Mac 2 1 Lenovo 4 1 Hp 1 2 Mac 3 2 MSI 5 2 Sony </code></pre>

Change table to tall format using panda (UNPIVOT)

   user         company company2 company3 company4
    1           Mac     Lenovo    Hp      null              
    2           Mac       MSI     Sony

And using pandas I would like it to be

     user    company
     1          Mac
     1          Lenovo
     1          Hp
     2         Mac

and so on Here I tried it but didnt work with pandas pivot.

dataframe = pd.read_csv('data.csv')
dataframe.fillna(value='', inplace=True)
#dataframe.pivot(index='user', columns='company')

Above code doesnt work and gives error.

486

asked Apr 14 '17 19:04

Aurora

1 Answers

you can use pd.melt method:

In [211]: pd.melt(df, id_vars='user', value_vars=df.columns.drop('user').tolist())
Out[211]:
   user  variable   value
0     1   company     Mac
1     2   company     Mac
2     1  company2  Lenovo
3     2  company2     MSI
4     1  company3      Hp
5     2  company3    Sony
6     1  company4    null
7     2  company4     NaN

In [213]: pd.melt(df,
                  id_vars='user', value_vars=df.columns.drop('user').tolist(),
                  value_name='Company') \
            .drop('variable',1)
Out[213]:
   user Company
0     1     Mac
1     2     Mac
2     1  Lenovo
3     2     MSI
4     1      Hp
5     2    Sony
6     1    null
7     2     NaN

UPDATE: dropping NaN's and sorting resulting DF by user:

In [218]: pd.melt(df,
     ...:         id_vars='user', value_vars=df.columns.drop('user').tolist(),
     ...:         value_name='Company') \
     ...:   .drop('variable',1) \
     ...:   .dropna() \
     ...:   .sort_values('user')
     ...:
Out[218]:
   user Company
0     1     Mac
2     1  Lenovo
4     1      Hp
6     1    null
1     2     Mac
3     2     MSI
5     2    Sony

PS if you want to get rid of null values - use df.replace('null', np.nan) instead of df:

In [219]: pd.melt(df.replace('null', np.nan),
     ...:         id_vars='user', value_vars=df.columns.drop('user').tolist(),
     ...:         value_name='Company') \
     ...:   .drop('variable',1) \
     ...:   .dropna() \
     ...:   .sort_values('user')
     ...:
Out[219]:
   user Company
0     1     Mac
2     1  Lenovo
4     1      Hp
1     2     Mac
3     2     MSI
5     2    Sony

answered Sep 28 '22 12:09

MaxU - stop WAR against UA

Related questions
                            
                                Django - Generating random, unique slug field for each model object
                            
                                Showing total on stacked bar Plotly
                            
                                How does Python ensure the return value of __len__ is an integer when len is called?
                            
                                Add a bookmark to a PDF with PyPDF2
                            
                                Python 3D Plots over non-rectangular domain
                            
                                Remove redundant square brackets in a list python [duplicate]
                            
                                Creating gist directly from Jupyper notebook?
                            
                                Python OpenCV - Extrapolating the largest rectangle off of a set of contour points
                            
                                Incremental Word2Vec Model Training in gensim
                            
                                Python - How to generate the Pairwise Hamming Distance Matrix
                            
                                Django CreateView success message not shown
                            
                                Formatting consecutive numbers
                            
                                How do I receive the data coming from IBs API in Python?
                            
                                Pandas .dt.hour formatting
                            
                                Pandas: How to do a boxplot bases in rows values instead of column values?
                            
                                aws CLI unable to be used due to module colorama
                            
                                sqlalchemy table schema autoload
                            
                                Python pandas -> select by condition in columns name
                            
                                How can I use psycopg2.extras in sqlalchemy?
                            
                                Sum of previous rows values

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Change table to tall format using panda (UNPIVOT)

Tags:

python

pandas

dataframe

unpivot

Aurora

People also ask

1 Answers

MaxU - stop WAR against UA

Recent Activity

Donate For Us