How to get the number of rows in a Pandas chunk?

Tags:

I'm reading a huge csv file by iterating over chunks. How can I get the size of the currently processed chunk? Especially the last chunk may have smaller number of rows than defined with the parameter chunksize.

reader = pd.read_table('myFile.csv', sep=';', chunksize=100)

339

asked Jan 07 '17 17:01

Jonathan Roth

1 Answers

You need check length of DataFrame:

for x in reader:
    print (len(x.index))
    print (len(x))
    print (x.shape[0])

129

answered Oct 23 '22 08:10

jezrael

Related questions
                            
                                How to create a second None in Python? Making a singleton object where the id is always the same
                            
                                Python lxml etree.tostring() returns empty string running on mod_wsgi
                            
                                Creating PyPi package - Could not find a version that satisfies the requirement iso8601 [duplicate]
                            
                                How to add edge in mesh using Maya Python API 2.0
                            
                                ConcatOp : Dimensions of inputs should match
                            
                                Spark Dataframes: Skewed Partition after Join
                            
                                Pandas idiomatic way to custom fillna
                            
                                Reshaping Pandas Dataframe with Grouped Data (Long to Wide)
                            
                                Django: Update multiple objects attributes
                            
                                isinstance not working for Decimal in AppEngine
                            
                                Pandas read_csv, reading a boolean with missing values specified as an int
                            
                                Removing text while processing the image
                            
                                uWSGI NOT working with .ini file
                            
                                GridSearch with Keras Neural Networks
                            
                                Why is `NaN` considered "smaller" than `-np.inf` in numpy?
                            
                                How to get native windows path inside msys python?
                            
                                Error in parsing, update multiple columns in 1 line
                            
                                xarray with masked arrays while preserving integer dtypes
                            
                                Efficiently check if an element occurs at least n times in a list
                            
                                why can't I import geopandas?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

How to get the number of rows in a Pandas chunk?

Tags:

python

pandas

csv

Jonathan Roth

People also ask

1 Answers

jezrael

Recent Activity

Donate For Us