Populating a Pandas DataFrame frome another DataFrame based on column names

Tags:

pandas

I have a DataFrame of the following form:

And I have a list of column names that I need to use to create a new DataFrame using the columns of the first DataFrame that correspond to each label. For example, if my list of columns is ['a', 'b', 'b', 'a', 'c'], the resulting DataFrame should be:

    a b b a c
0   1 4 4 1 6   
1   3 2 2 3 4
2   4 1 1 4 5

I've been trying to figure out a fast way of performing this operations because I'm dealing with extremly large DataFrames and I don't think looping is a reasonable option.

778

asked Apr 07 '14 13:04

c_david

1 Answers

You can just use the list to select them:

In [44]:

cols = ['a', 'b', 'b', 'a', 'c']
df[cols]
Out[44]:
   a  b  b  a  c
0  1  4  4  1  6
1  3  2  2  3  4
2  4  1  1  4  5

[3 rows x 5 columns]

So no need for a loop, once you have created your dataframe df then using a list of column names will just index them and create the df you want.

179

answered Sep 30 '22 15:09

EdChum

Related questions
                            
                                Classifying text documents with random forests
                            
                                get all parents of xml node using python
                            
                                Writing webapps in python without Django or any framework [closed]
                            
                                How to format with a UNICODE string to JINJA's variable in a template?
                            
                                How to efficiently apply an operator to the cartesian product of two arrays?
                            
                                wxpython - One Frame, Multiple Panels, Modularized Code
                            
                                Scikit learn - How to use SVM and Random Forest for text classification?
                            
                                Python lists: indices of heapq.nlargest with repeating values in list
                            
                                Text alignment: extracting matching sequence using python
                            
                                Neat way to conditionally select elements from two lists?
                            
                                Add multiple line in python single table cell
                            
                                How do I 'force' python to use a specific version of a module?
                            
                                Web scraping a website with dynamic javascript content
                            
                                PyQt: How to set Combobox Items be Checkable?
                            
                                How do I quit a Django development server with a bash script?
                            
                                Why does setdefault evaluate default when key is set?
                            
                                is there a preferred way to test callbacks with pytest?
                            
                                dropping duplicates randomly
                            
                                Buffer to Image with PIL
                            
                                How to check if there exists a row with a certain column value in pandas dataframe

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With