Reading any text file having strange encoding?

Tags:

I have a text file with a strange encoding "UCS-2 Little Endian" that I want to read its contents using Java.

Opening the text file using NotePad++

As you can see in the above screenshot the file contents appear fine in Notepad++, but when i read it using this code, just garbage is being printed in the console:

String textFilePath = "c:\strange_file_encoding.txt"
BufferedReader reader = new BufferedReader( new InputStreamReader( new FileInputStream( filePath ), "UTF8" ) );
String line = "";

while ( ( line = reader.readLine() ) != null ) {
    System.out.println( line );  // Prints garbage characters 
}

The main point is that the user selects the file to read, so it can be of any encoding, and since I can't detect the file encoding I decode it using "UTF8" but as in the above example it fails to read it right.

Is there away to read such strange files in a right way ? Or at least can i detect if my code will fail to read it right ?

325

asked Mar 19 '13 22:03

Brad

1 Answers

You are using UTF-8 as your encoding in the InputStreamReader constructor, so it will try to interpret the bytes as UTF-8 instead of UCS-LE. Here is the documentation: Charset

I suppose you need to use UTF-16LE according to it.

Here is more info on the supported character sets and their Java names: Supported Encodings

answered Nov 10 '22 13:11

tempoc

Related questions
                            
                                Running code before and after all tests in a surefire execution
                            
                                Seed a random generator without time in java cross-plateform-ably
                            
                                Java Array in Jruby
                            
                                writing image to a jsp from database
                            
                                Does Google Talk support XMPP Multi-User Chat?
                            
                                Get output of terminal command using Java
                            
                                Example Maven pom.xml for Java based Selenium WebDriver project for Firefox
                            
                                Trying to show arabic characters in Java
                            
                                Can't import org.springframework.jdbc.core with maven
                            
                                Java generics : wildcards
                            
                                How to save ip address to a DB from authenticated user with Spring security?
                            
                                Custom hashcode/equals operation for HashMap
                            
                                NPE in clojure.lang.Compiler when trying to load a resource
                            
                                How can I handle multiple clients connected to a server using sockets?
                            
                                How can I transform SolrQuery(SOLRJ) to URL?
                            
                                Intellij-IDEA: How to put path to a method in my clipboard
                            
                                Generating BPEL files programmatically?
                            
                                How to Run a specific Main class from a jar
                            
                                Why I must override clone if i want cloneable class?
                            
                                Android selector with image and text

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With

Reading any text file having strange encoding?

Tags:

java

text-files

bufferedreader

fileinputstream

Brad

People also ask

1 Answers

tempoc

Recent Activity

Donate For Us