What's an "ignorable character in a Java identifier"

Tags:

intellij-idea

I stumbled across this doc and wondered what that was all about. Apparently you can have certain control characters inside identifiers and they are ignored:

public static void main(String[] args) throws Exception {
    int dummy = 123;
    System.out.println(dummy); // Has U+200B after the `d` before the `u`
}

I couldn't find anything about this in the JLS. IntelliJ IDEA gives an error in the editor saying "dummy" is an undeclared identifier (but nevertheless it compiles and runs). I guess that's an error in IntelliJ? What purpose do these "ignoreable characters" serve?

(Note: StackOverflow seems to remove my control characters from the question)

489

asked Jun 22 '17 14:06

Klitos Kyriacou

1 Answers

There is an open issue for this contradiction.

In summary, these characters are indeed ignored for identifier name matching by the compiler but JLS doesn't mention this. Instead JLS says:

Two identifiers are the same only if they are identical, that is, have the same Unicode character for each letter or digit.

Also

A "Java letter-or-digit" is a character for which the method Character.isJavaIdentifierPart(int) returns true

The contradiction is obvious as:

Character.isJavaIdentifierPart('\u0001')  -> true, so used to compare identifier names
Character.isIdentifierIgnorable('\u0001') -> true, should be ignored actually

I speculate that Intellij IDEA follows the JLS or they are simply unaware of ignorable characters. I don't see a bug report for this here.

As to what is the purpose of these ignorables, unicode specifies some Layout and Format Control Characters. It is suggested that these characters should be ignored in identifier names as

the effects they represent are stylistic or otherwise out of scope for identifiers, and second because the characters themselves often have no visible display

Apparently the purpose of isIdentifierIgnorable is to identify characters of this category. For instance it's mentioned in the isIdentifierIgnorable documentation that it returns true for characters that have the FORMAT general category value which are characters with unicode General_Category value of Cf which are included in the Layout and Format Control Characters

136

answered Oct 15 '22 18:10

Manos Nikolaidis

Related questions
                            
                                Where is project.properties in Android Studio project
                            
                                Spring boot No default constructor found on @SpringBootApplication class
                            
                                Add custom data source to Jaspersoft Studio
                            
                                Sending a stream of documents to a Jersey @POST endpoint
                            
                                Types in a LambdaMetaFactory
                            
                                ExecutorService that executes tasks sequentially but takes threads from a pool
                            
                                How to choose what implementation get's injected in to an autowired constructor
                            
                                Why does MockMvc always return empty content()?
                            
                                Splunk HttpEventCollectorLogbackAppender how to set source and host?
                            
                                How to configure logback-access.xml with Spring Boot
                            
                                Profiling Java application in kubernetes
                            
                                Spring Data REST + JPA remove from OneToMany collection [not owner side]
                            
                                Final field marked @NotNull is not initialized
                            
                                Vaadin: How to add META-INF/services to the war?
                            
                                Class inheritance: generic extends generic
                            
                                Maven - Transitive dependencies are not resolved for artifact deployed on Artifactory
                            
                                android testBuildType not working
                            
                                IntelliJ IDEA - "Imported project refers to unknown jdks JavaSE-1.8" while import Eclipse projects
                            
                                How glassfish domains are separated from each other?
                            
                                Error in hadoop jobs due to hive query error

Donate For Us

If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!

Donate Us With