Should source code be saved in UTF-8 format

Tags:

How important is it to save your source code in UTF-8 format?

Eclipse on Windows uses CP1252 character encoding by default. The CP1251 format means non UTF-8 characters can be saved and I have seen this happen if you copy and paste from a Word document for a comment.

The reason I ask is because out of habit I set-up Maven encoding to be in UTF-8 format and recently it has caught a few non mappable errors.

(update) Please add any reasons for doing so and why, are there some common gotchas that should be known?

(update) What is your goal? To find the best practice so when ask why should we use UTF-8 I have a good answer, right now I don't.

523

asked Feb 01 '10 16:02

JARC

2 Answers

What is your goal? Balance your needs against the pros and cons of this choice.

UTF-8 Pros

allows use of all character literals without \uHHHH escaping

UTF-8 Cons

using non-ASCII character literals without \uHHHH increases risk of character corruption
- font and keyboard issues can arise
- need to document and enforce use of UTF-8 in all tools (editors, compilers build scripts, diff tools)
beware the byte order mark

ASCII Pros

character/byte mappings are shared by a wide range of encodings
- makes source files very portable
- often obviates the need for specifying encoding meta-data (since the files would be identical if they were re-encoded as UTF-8, Windows-1252, ISO 8859-1 and most things short of UTF-16 and/or EBCDIC)

ASCII Cons

limited character set
this isn't the 1960s

Note: ASCII is 7-bit, not "extended" and not to be confused with Windows-1252, ISO 8859-1, or anything else.

answered Oct 14 '22 13:10

McDowell

Important is at least that you need to be consistent with the encoding used to avoid herrings. Thus not, X here, Y there and Z elsewhere. Save source code in encoding X. Set code input to encoding X. Set code output to encoding X. Set characterbased FTP transfer to encoding X. Etcetera.

Nowadays UTF-8 is a good choice as it covers every character the human world is aware of and is pretty everywhere supported. So, yes, I would set workspace encoding to it as well. I also use it so.

answered Oct 14 '22 12:10

BalusC

Related questions
                            
                                HttpServletRequestWrapper, example implementation for setReadListener / isFinished / isReady?
                            
                                Are there any standard Java classes with inconsistent compareTo() and equals()?
                            
                                Generate Typescript Interfaces from Java Interfaces
                            
                                Why is catching checked exceptions allowed for code that does not throw exceptions?
                            
                                Detect internet Connection using Java [duplicate]
                            
                                Pinning a Java application to the Windows 7 taskbar
                            
                                Can a progress bar be used in a class outside main?
                            
                                Default threads like, DestroyJavaVM, Reference Handler, Signal Dispatcher
                            
                                Comparison of Java reactive frameworks [closed]
                            
                                What does 'Unsupported major.minor version 52.0' mean, and how do I fix it? [duplicate]
                            
                                Is <T> List<? extends T> f() a useful signature
                            
                                Hibernate/persistence without @Id
                            
                                Controlling Task execution order with ExecutorService
                            
                                @OneToMany and composite primary keys?
                            
                                Sorting int array in descending order [duplicate]
                            
                                Adding entity classes dynamically at runtime
                            
                                Java generic method declaration [duplicate]
                            
                                What is the difference between thread per connection vs thread per request?
                            
                                Core difference between object oriented and object based language
                            
                                Gradle build fail: Process 'Gradle Test Executor 1' finished with non-zero exit value 1

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Should source code be saved in UTF-8 format

Tags:

java

eclipse

encoding

utf-8

JARC

People also ask

2 Answers

McDowell

BalusC

Recent Activity

Donate For Us