Logo Questions Linux Laravel Mysql Ubuntu Git Menu
 

Where are the fields documented for the unicode.org file "UnicodeData.txt"? [closed]

Tags:

unicode

I cannot find documentation for the actual fields of the UnicodeData.txt file.

The data is available here. The document describing it is available here but it doesn't list the actual field numbers and what the field is (like used to be in the document around version 3.0).

I've searched the site and must be missing something that is right in front of my eyes, but I can't find it.

Can someone point out where this information is now?

like image 695
brofield Avatar asked Apr 16 '09 06:04

brofield


People also ask

What are the fields in the Unicode character database?

Each line of data consists of fields separated by semicolons. The fields are numbered starting with zero. The first field (0) of each line in the Unicode Character Database files represents a code point or range. The remaining fields (1..n) are properties associated with that code point.

What's the unicodedata file?

This document describes the format and content of the UnicodeData.txt file in the Unicode Character Database (UCD). The file and the files described herein are part of the Unicode Character Database and governed by the UCD Terms of Use given below.

What is the Unicode data file (CD)?

The data files define the Unicode character properties and mappings between Unicode characters (such as case mappings). This annex describes the UCD and provides a guide to the various documentation files associated with it. Additional information about character properties and their use is contained in the Unicode Standard and its annexes.

Is there a Unicode database in Python?

unicodedata — Unicode Database — Python 3.9.6 documentation unicodedata — Unicode Database ¶ This module provides access to the Unicode Character Database (UCD) which defines character properties for all Unicode characters. The data contained in this database is compiled from the UCD version 13.0.0.


1 Answers

update

sorry, I misread the question. Still, I think the information is in the link you provided, under section UnicodeData.txt. For each field, a link inside the document lists its values if applicable. Seems to be the same list as in the 3.0 version.

like image 50
devio Avatar answered Sep 22 '22 00:09

devio