I am trying to fit Blaze data object to scikit kmeans function.
from blaze import *
from sklearn.cluster import KMeans
data_numeric = Data('data.csv')
data_cluster = KMeans(n_clusters=5)
data_cluster.fit(data_numeric)
Data Sample:
A B C
1 32 34
5 57 92
89 67 21
Its throwing error :
I have been able to do it with Pandas Dataframe. Any way to feed blaze object to this function ?
I think you need to convert your pandas dataframe into an numpy array before you fit.
from blaze import *
import numpy
from sklearn.cluster import KMeans
data_numeric = numpy.array(data('data.csv'))
data_cluster = KMeans(n_clusters=5)
data_cluster.fit(data_numeric)
sklearn.cluster.KMeans
don't support input data with type blaze.interactive._Data
which is the type of data_numeric in your code.
You can use data_cluster.fit(data_numeric.peek())
to fit the transferred data_numeric with type DataFrame
supported by sklearn.cluster.KMeans
.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With