Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

What is the equivalent to iloc for dask dataframe?

Tags:

python

dask

I have a situation where I need to index a dask dataframe by location. I see that there is not an .iloc method available. Is there an alternative? Or am I required to use label-based indexing?

For example, I would like to

import dask.dataframe as dd
import numpy as np
import pandas as pd
df = dd.from_pandas(pd.DataFrame({k:np.random.random(10) for k in ['a', 'b']}), npartitions=2)
inds = [1, 4, 6, 8]
df.iloc[inds]

Is this not possible with dask? (e.g., Perhaps a positional location is not well-defined?) In this case, what can I do if I only know the positional indices (not labels) of the values I need to access?

like image 698
Tim Morton Avatar asked Oct 16 '17 15:10

Tim Morton


1 Answers

Positional indexing is not available for dask dataframe, nor is it likely to be available in the near future.

like image 88
MRocklin Avatar answered Oct 29 '22 08:10

MRocklin