Python pickle error: UnicodeDecodeError

Tags:

I'm trying to do some text classification using Textblob. I'm first training the model and serializing it using pickle as shown below.

import pickle from textblob.classifiers import NaiveBayesClassifier  with open('sample.csv', 'r') as fp:      cl = NaiveBayesClassifier(fp, format="csv")  f = open('sample_classifier.pickle', 'wb') pickle.dump(cl, f) f.close()

And when I try to run this file:

import pickle f = open('sample_classifier.pickle', encoding="utf8") cl = pickle.load(f)     f.close()

I get this error:

UnicodeDecodeError: 'utf-8' codec can't decode byte 0x80 in position 0: invalid start byte

Following are the content of my sample.csv:

My SQL is not working correctly at all. This was a wrong choice, SQL

I've issues. Please respond immediately, Support

Where am I going wrong here? Please help.

242

asked Oct 05 '15 20:10

90abyss

1 Answers

By choosing to open the file in mode wb, you are choosing to write in raw binary. There is no character encoding being applied.

Thus to read this file, you should simply open in mode rb.

answered Sep 23 '22 19:09

donkopotamus

Related questions
                            
                                How can I distribute python programs?
                            
                                Preferred (or most common) file extension for a Python pickle
                            
                                Local variables in nested functions
                            
                                Is there an equivalent to CTRL+C in IPython Notebook in Firefox to break cells that are running?
                            
                                IndexError: tuple index out of range when using py2exe
                            
                                Explain the "setUp" and "tearDown" Python methods used in test cases
                            
                                Dump to JSON adds additional double quotes and escaping of quotes
                            
                                How I can I lazily read multiple JSON values from a file/stream in Python?
                            
                                How can mypy ignore a single line in a source file?
                            
                                How to save all the variables in the current python session?
                            
                                Python constructors and __init__
                            
                                Sample datasets in Pandas
                            
                                Why is an MD5 hash created by Python different from one created using echo and md5sum in the shell?
                            
                                Why do I get a SyntaxError for a Unicode escape in my file path?
                            
                                float64 with pandas to_csv
                            
                                Numpy index slice without losing dimension information
                            
                                Django - "no module named django.core.management"
                            
                                Python CSV error: line contains NULL byte
                            
                                Why does Popen.communicate() return b'hi\n' instead of 'hi'?
                            
                                Get row-index values of Pandas DataFrame as list? [duplicate]

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Python pickle error: UnicodeDecodeError

Tags:

python

pickle

textblob

90abyss

People also ask

1 Answers

donkopotamus

Recent Activity

Donate For Us