On 30 Dec 2005, at 18:49, Eric Baumgartner wrote:
Theo wrote:
By the way Eric, UnicodeData.txt doesn't contain all the unicode
data. There is some extra files needed for other text operations.
For the purposes of this thread -- filtering non-alphanumerics -- I
believe UnicodeData.txt is sufficient.
OK :)
If you have evidence otherwise, please cite the specific files that
would be needed.
No argument in this specific case. But now you've changed the case of
the conversation.
The sentance I replied to, you were talking about a more general
case. My Unicode upper casing code for example needed
SpecialCasing.txt and UnicodeData.txt. This project of mine
(UnicodeStuff.rb) can uppercase ß into SS, something RB can't do (as
well as a few hundred other characters not mapped by RB). Mine does
case operations around the same speed as RB, only it does more than
RB does, it's closer to the Unicode standard. (Although I don't know
Unicode well enough to say that it is exactly to the standard.)
_______________________________________________
Unsubscribe or switch delivery mode:
<http://www.realsoftware.com/support/listmanager/>
Search the archives of this list here:
<http://support.realsoftware.com/listarchives/lists.html>
|