sklearn partial fit of CountVectorizer

1 Answers

No, it does not support partial fit.

But you can write a simple method to accomplish your goal:

def partial_fit(self , data):
    if(hasattr(vectorizer , 'vocabulary_')):
        vocab = self.vocabulary_
    else:
        vocab = {}
    self.fit(data)
    vocab = list(set(vocab.keys()).union(set(self.vocabulary_ )))
    self.vocabulary_ = {vocab[i] : i for i in range(len(vocab))}

from sklearn.feature_extraction.text import CountVectorizer
CountVectorizer.partial_fit = partial_fit

vectorizer = CountVectorizer(stop_words=l)
vectorizer.fit(df[15].values[0:100])
vectorizer.partial_fit(df[15].values[100:200])

139

answered Sep 28 '22 07:09

sajjad

Related questions
                            
                                How to randomly select rows from a data set using pandas?
                            
                                How to visualize an sklearn GradientBoostingClassifier?
                            
                                Unable to transform string column to categorical matrix using Keras and Sklearn
                            
                                How to implement polynomial logistic regression in scikit-learn?
                            
                                How does sklearn random forest index feature_importances_
                            
                                Why does not GridSearchCV give best score ? - Scikit Learn
                            
                                Find the tf-idf score of specific words in documents using sklearn
                            
                                Cross validation for MNIST dataset with pytorch and sklearn
                            
                                Classification tree in sklearn giving inconsistent answers
                            
                                Remove single occurrences of words in vocabulary TF-IDF
                            
                                scipy sparse matrix: remove the rows whose all elements are zero
                            
                                sklearn: calculating accuracy score of k-means on the test data set
                            
                                Scikit-Learn Logistic Regression Memory Error
                            
                                TfidfVectorizer in sklearn how to specifically INCLUDE words
                            
                                scikit cosine_similarity vs pairwise_distances
                            
                                How to get precision, recall and f-measure from confusion matrix in Python [duplicate]
                            
                                How to run tsne on word2vec created from gensim?
                            
                                Load and predict new data sklearn
                            
                                scikit-learn & statsmodels - which R-squared is correct?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

sklearn partial fit of CountVectorizer

Tags:

scikit-learn

countvectorizer

Donbeo

People also ask

1 Answers

sajjad

Recent Activity

Donate For Us