I wonder whether this kind of library exists.
lua] supports all 5.3 string functions for UTF-8.
In order to convert a String into UTF-8, we use the getBytes() method in Java. The getBytes() method encodes a String into a sequence of bytes and returns a byte array. where charsetName is the specific charset by which the String is encoded into an array of bytes.
slnunicode is part of the collection of general purpose lua support libraries developed for the Selene database project.
It's also available as a luarock
Lua strings are a sequence of bytes. When you store UTF8 text in them, you're already storing "UTF8 bytes". You can get the bytes like with all other strings, using string.byte(s,i,j)
:
local bytes = { string.byte(unicodeString, 1,-1) }
Now bytes is a table containing your "UTF8 bytes".
More information about string.byte
and UTF8 in Lua is available at:
Standard Lua string library
Lua 5.3 standard utf8 library
Presentation by Roberto Ierusalimschy (one of the creators of Lua) on the future of Lua, which talks about many things and one of them is UTF8 support. It was released before UTF8 support was built into Lua.
Lua 5.3 has UTF-8 support in the standard library now.
For example, to get a UTF-8 string's code points:
for p, c in utf8.codes("瑞&于") do
print(c)
end
Output:
29790
38
20110
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With