Filter pandas dataframe with specific column names in python

Tags:

I have a pandas dataframe and a list as follows

mylist = ['nnn', 'mmm', 'yyy']
mydata =
   xxx   yyy zzz nnn ddd mmm
0  0  10      5    5   5  5
1  1   9      2    3   4  4
2  2   8      8    7   9  0

Now, I want to get only the columns mentioned in mylist and save it as a csv file.

i.e.

     yyy  nnn   mmm
0    10     5     5
1    9      3     4
2    8      7     0

My current code is as follows.

mydata = pd.read_csv( input_file, header=0)

for item in mylist:
    mydata_new = mydata[item]

print(mydata_new)
mydata_new.to_csv(file_name)

It seems to me that my new dataframe produces wrong results.Where I am making it wrong? Please help me!

692

asked Jan 11 '18 00:01

J Cena

1 Answers

Just pass a list of column names to index df:

df[['nnn', 'mmm', 'yyy']]

   nnn  mmm  yyy
0    5    5   10
1    3    4    9
2    7    0    8

If you need to handle non-existent column names in your list, try filtering with df.columns.isin -

df.loc[:, df.columns.isin(['nnn', 'mmm', 'yyy', 'zzzzzz'])]

   yyy  nnn  mmm
0   10    5    5
1    9    3    4
2    8    7    0

173

answered Sep 18 '22 17:09

cs95

Related questions
                            
                                Show non printable characters in a string
                            
                                Showing an image with pylab.imshow()
                            
                                How do I decrypt using hashlib in python?
                            
                                django form dropdown list of stored models
                            
                                Python Scrapy on offline (local) data
                            
                                Transactions and sqlalchemy
                            
                                What happens if you write a variable name alone in python?
                            
                                Issue with merging multiple JSON files in Python
                            
                                Static variable in Python?
                            
                                Running a .py file from Java
                            
                                python pandas: rename single column label in multi-index dataframe
                            
                                Restarting a thread in Python
                            
                                How to remove index from a created Dataframe in Python?
                            
                                finding last business day of a month in python
                            
                                Using lambda if condition on different columns in Pandas dataframe
                            
                                Python: How can I enable use of kwargs when calling from command line? (perhaps with argparse)
                            
                                How to run Python Flask within a Docker container [duplicate]
                            
                                How to set the label Fonts as "Time New Roman" by drawparallels in python
                            
                                How to extract the file name from a file path?
                            
                                Python module not found even though "Requirement Already satisfied in Pip"

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Filter pandas dataframe with specific column names in python

Tags:

python

pandas

dataframe

J Cena

People also ask

1 Answers

cs95

Recent Activity

Donate For Us