I have a situation where I need to index a dask dataframe by location. I see that there is not an .iloc
method available. Is there an alternative? Or am I required to use label-based indexing?
For example, I would like to
import dask.dataframe as dd
import numpy as np
import pandas as pd
df = dd.from_pandas(pd.DataFrame({k:np.random.random(10) for k in ['a', 'b']}), npartitions=2)
inds = [1, 4, 6, 8]
df.iloc[inds]
Is this not possible with dask? (e.g., Perhaps a positional location is not well-defined?) In this case, what can I do if I only know the positional indices (not labels) of the values I need to access?
Positional indexing is not available for dask dataframe, nor is it likely to be available in the near future.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With