Suppose I have a csv file with 400 columns. I cannot load the entire file into a DataFrame (won't fit in memory). However, I only really want 50 columns, and this will fit in memory. I don't see any built in Pandas way to do this. What do you suggest? I'm open to using the <code>PyTables</code> interface, or <code>pandas.io.sql</code>. The best-case scenario would be a function like: <code>pandas.read_csv(...., columns=['name', 'age',...,'income'])</code>. I.e. we pass a list of column names (or numbers) that will be loaded.

Ian, I implemented a <code>usecols</code> option which does exactly what you describe. It will be in upcoming pandas 0.10; development version will be available soon. <hr> Since <code>0.10</code>, you can use <code>usecols</code> like <pre class="prettyprint"><code>df = pd.read_csv(...., usecols=['name', 'age',..., 'income']) </code></pre>

How to load only specific columns from csv file into a DataFrame

Tags:

python

pandas

csv

Suppose I have a csv file with 400 columns. I cannot load the entire file into a DataFrame (won't fit in memory). However, I only really want 50 columns, and this will fit in memory. I don't see any built in Pandas way to do this. What do you suggest? I'm open to using the PyTables interface, or pandas.io.sql.

The best-case scenario would be a function like: pandas.read_csv(...., columns=['name', 'age',...,'income']). I.e. we pass a list of column names (or numbers) that will be loaded.

679

asked Nov 05 '12 16:11

Ian Langmore

1 Answers

Ian, I implemented a usecols option which does exactly what you describe. It will be in upcoming pandas 0.10; development version will be available soon.

Since 0.10, you can use usecols like

df = pd.read_csv(...., usecols=['name', 'age',..., 'income'])

answered Sep 28 '22 02:09

Wes McKinney

Related questions
                            
                                How boxen plot is different from box plot?
                            
                                How can I count time in Python 3?
                            
                                How to get maximum and minimum of a list in column?
                            
                                TypeError: argument of type 'WindowsPath' is not iterable - in django python [duplicate]
                            
                                pip install pyodbc failed ERROR: Failed building wheel for pyodbc
                            
                                Getting out of a function in Python
                            
                                Remove items from a list while iterating without using extra memory in Python
                            
                                Extracting words between delimiters [] in python
                            
                                Run python in a separate process
                            
                                Can I have two init functions in a python class?
                            
                                Threaded Tkinter script crashes when creating the second Toplevel widget
                            
                                How to avoid line color repetition in matplotlib.pyplot?
                            
                                Locate MacPorts package?
                            
                                How to truncate the values of a 2D numpy array
                            
                                dir_util.copy_tree fails after shutil.rmtree
                            
                                Is there a way to read a .txt file and store each line to memory?
                            
                                How to compare inheritance with several classes?
                            
                                Can't get single \ in python
                            
                                How to visualize descriptor matching using opencv module in python
                            
                                Using python to create an average out of a list of times

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With