I cannot find documentation for the actual fields of the UnicodeData.txt file.
The data is available here. The document describing it is available here but it doesn't list the actual field numbers and what the field is (like used to be in the document around version 3.0).
I've searched the site and must be missing something that is right in front of my eyes, but I can't find it.
Can someone point out where this information is now?
Each line of data consists of fields separated by semicolons. The fields are numbered starting with zero. The first field (0) of each line in the Unicode Character Database files represents a code point or range. The remaining fields (1..n) are properties associated with that code point.
This document describes the format and content of the UnicodeData.txt file in the Unicode Character Database (UCD). The file and the files described herein are part of the Unicode Character Database and governed by the UCD Terms of Use given below.
The data files define the Unicode character properties and mappings between Unicode characters (such as case mappings). This annex describes the UCD and provides a guide to the various documentation files associated with it. Additional information about character properties and their use is contained in the Unicode Standard and its annexes.
unicodedata — Unicode Database — Python 3.9.6 documentation unicodedata — Unicode Database ¶ This module provides access to the Unicode Character Database (UCD) which defines character properties for all Unicode characters. The data contained in this database is compiled from the UCD version 13.0.0.
update
sorry, I misread the question. Still, I think the information is in the link you provided, under section UnicodeData.txt. For each field, a link inside the document lists its values if applicable. Seems to be the same list as in the 3.0 version.
If you love us? You can donate to us via Paypal or buy me a coffee so we can maintain and grow! Thank you!
Donate Us With