I am somehow having difficulty reading in this file into python with pandas read_table function. http://www.ssc.wisc.edu/~bhansen/econometrics/invest.dat
This is my code:
pd.read_table(f,skiprows=[0], sep="")
Which yields error:
TypeError: ord() expected a character, but string of length 0 found
If the DAT file is inside a system folder, you shouldn't attempt to open it, because it could be in use by one of your apps as a configuration file. You can also use the trial-and-error method by trying to open it with several apps, or you can contact the creator of the file.
dat file in Windows using a text editor, right-click on the file you want to open, and select Open With. Select the text editor you want to use, and click OK. You'll be able to read the file's contents if it's a text-based . dat file.
dat Explorer is a free app to open those 'winmail. dat' attachments. This app is free of charge, giving you access to the original attachment files without any need for further in-app purchases. The optional in-app-purchase will remove ads and help fund further development of new features.
Dont know about read_table, but you can read this file directly as follows:
import pandas as pd
with open('/tmp/invest.dat','r') as f:
next(f) # skip first row
df = pd.DataFrame(l.rstrip().split() for l in f)
print(df)
Prints:
0 1 2 3
0 17.749000 0.66007000 0.15122000 0.33150000
1 3.9480000 0.52889000 0.11523000 0.56233000
2 14.810000 3.7480300 0.57099000 0.12111000
...
...
The same can be obtained as follows:
df = pd.read_csv('/tmp/invest.dat', sep='\s+', header=None, skiprows=1)
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With