Scintilla Supports Unicode? What about SCI_GETCHARAT?

Question

Does Scintilla really support Unicode? If so, why does SCI_GETCHARAT return a char value (casted to LRESULT)?

Martin Stone · Accepted Answer

From the SCI_SETCODEPAGE docs...

Code page SC_CP_UTF8 (65001) sets Scintilla into Unicode mode with the document treated as a sequence of characters expressed in UTF-8. The text is converted to the platform's normal Unicode encoding before being drawn by the OS and thus can display Hebrew, Arabic, Cyrillic, and Han characters.

You will have to examine the byte you retrieve with SCI_GETCHARAT(pos) and, depending on the top bits of that, maybe read SCI_GETCHARAT(pos+1) and beyond in order to get the Unicode code point. (See here.)

Edit:

For some C++ code that does this, see below (search for SciMoz::GetWCharAt):

http://vacuproj.googlecode.com/svn/trunk/npscimoz/npscimoz/oldsrc/trunk.nsSciMoz.cxx

sorin · Answer

I was long time ago but if I remember well Scintilla is not a native Unicode application. Still it has some Unicode support.

First, the function name should SCI_GETBYTEAT, because it returns a byte from UTF-8 internal buffer.

Also, the application has Unicode support for keybaord, so it has some Unicode support :)

Scintilla Supports Unicode? What about SCI_GETCHARAT?

Tags:

unicode

scintilla

user541686

2 Answers

Martin Stone

sorin

Recent Activity

Donate For Us

Scintilla Supports Unicode? What about SCI_GETCHARAT?

Tags:

unicode

scintilla

user541686

2 Answers

Martin Stone

sorin

Related questions

Recent Activity

Donate For Us