Customizing the separator in pandas read_csv

Tags:

I am reading many different data files into various pandas dataframes. The columns in these datafiles are separated by spaces. However, for each file, the number of spaces is different (for some of them, there is only one space, for others, there are two spaces and so on). Thus, every time I import the file, I have to manually go to that file and see the number of spaces that have been used and give those many number of spaces in sep:

import pandas as pd df = pd.read_csv('myfile.dat', sep = '    ')

Is there any way I can tell pandas to assume "any number of spaces" as the separator? Also, is there any way I can tell pandas to use either tab (\t) or spaces as the separator?

709

asked Dec 20 '16 04:12

Peaceful

1 Answers

Yes, you can use a simple regular expression like sep='\s+' to denote one or more spaces.

answered Sep 17 '22 02:09

Ted Petrou

Related questions
                            
                                How to test a React Native component that imports a custom native module with Jest?
                            
                                "Failed - Network Error" When trying to provide download in HTML5 using 'download' attribute
                            
                                What is the idiomatic way to remove the first N elements in a mutable Vec?
                            
                                Jenkins pipeline - try catch for particular stage and subsequent conditional step
                            
                                Registering User with Laravel Passport
                            
                                What is the differences between updateOrCreate() and updateOrInsert() in Laravel
                            
                                How to Remove Client Headers in Nginx before passing request to upstream server?
                            
                                Know if there are pending request in axios
                            
                                How do I get the certificate authority certificate/key from a cluster created by kops?
                            
                                R - Changing ggplot plot size in jupyter
                            
                                LocalNotification with AlarmManager and BroadcastReceiver not firing up in Android O (oreo)
                            
                                Why does [[]][0]++ work but []++ throws run-time exception?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With