REBOL 3.0

Collation table and sorting

Carl Sassenrath, CTO
REBOL Technologies
13-Feb-2008 17:02 GMT

Article #0006
Main page || Index || Prior Article [0005] || Next Article [0007] || 7 Comments || Send feedback

One of the final parts of the R3 Unicode implementation is to define the sort order for strings.

The sort order will be determined by a simple collation table that for 3.0 assumes strings are normalized into single codepoint representations. Later, for 3.* we can add other methods.

The standard R3 distribution will contain the collation table for the "western" charsets, including latin-1, etc. We can do the same for some of the other smaller eastern charsets, and I am open to others, as proposed and defined by the REBOL community (RC). I will admit I know nothing about the sort order of Chinese or other Asian codepoints. (But, I know that some of you do!)

Once booted, R3 will allow programs to set their own collation table.

Question:

Do you think this method is sufficient for 3.0 or am I missing something important?

7 Comments

REBOL 3.0
Updated 16-Jul-2013 - Edit - Copyright REBOL Technologies - REBOL.net