Why does my Python code print the extra characters "ï»¿" when reading from a text file?

People also ask

How do I remove a BOM character from a csv file in Python?

First, you need to decode the file contents, not encode them. Second, the csv module doesn't like unicode strings in Python 2.7, so having decoded your data you need to convert back to utf-8. Finally, csv. reader is passed an iteration over the lines of the file, not a big string with linebreaks in it.

What 2 items do you need to specify when when reading a file in Python?

When opening a file for reading, Python needs to know exactly how the file should be opened with the system. Two access modes are available - reading, and reading in binary mode. The respective flags used are r , and rb , and have to be specified when opening a file with the built-in open() method.

I can't find a duplicate of this for Python 3, which handles encodings differently from Python 2. So here's the answer: instead of opening the file with the default encoding (which is 'utf-8'), use 'utf-8-sig', which expects and strips off the UTF-8 Byte Order Mark, which is what shows up as ï»¿.

That is, instead of

data = open('info.txt')

data = open('info.txt', encoding='utf-8-sig')

Note that if you're on Python 2, you should see e.g. Python, Encoding output to UTF-8 and Convert UTF-8 with BOM to UTF-8 with no BOM in Python. You'll need to do some shenanigans with codecs or with str.decode for this to work right in Python 2. But in Python 3, all you need to do is set the encoding= parameter when you open the file.

I had a very similar problem when dealing with excel csv files. Initially I had saved my file from the drop down choices as a .csv utf-8(comma delimited) file. Then I saved it as just a .csv(comma delimited) file and all was well. Perhaps there might be something similar issue with a .txt file

Related questions
                            
                                How to insert newline in python logging?
                            
                                uWSGI: No request plugin is loaded, you will not be able to manage requests
                            
                                alembic revision - multiple heads (due branching) error
                            
                                Why is a Python I/O bound task not blocked by the GIL?
                            
                                Python Flask send_file StringIO blank files
                            
                                Difference between a 'for' loop and map
                            
                                Should I worry about circular references in Python?
                            
                                Python: How do you convert a datetime/timestamp from one timezone to another timezone?
                            
                                Jupyter | How to rotate 3D graph [duplicate]
                            
                                Generating sine wave sound in Python
                            
                                Python: Why is global needed only on assignment and not on reads?
                            
                                Python Mysql, "commands out of sync; you can't run this command now"
                            
                                django-rest-framework: api versioning
                            
                                Python sqlite3.OperationalError: no such table:
                            
                                In Pandas, how to delete rows from a Data Frame based on another Data Frame?
                            
                                Python: Unpacking an inner nested tuple/list while still getting its index number
                            
                                How to get an UTC date string in Python? [duplicate]
                            
                                How to specify upper and lower limits when using numpy.random.normal
                            
                                Python built-in function "compile". What is it used for?
                            
                                How can I overlay two graphs in Seaborn?

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Why does my Python code print the extra characters "ï»¿" when reading from a text file?

Tags:

python

file-handling

People also ask

Recent Activity

Donate For Us