Re: UTF-8
- Posted by mattlewis (admin) Mar 10, 2011
- 2250 views
I think 31 bits is OK, but i will look at it again and report back in detail. As a quick comment, the absence of the higher-most bit in 4 bytes might only affect some (hopefully minor) East Asian languages. Of course with 63 or 64 bits we will be able to accommodate all the Planetary and many of the trans-universe languages
To be clear, inside a sequence (or a declared atom), the promotion from integer to double is automatic and mostly transparent.
I was wondering if you have looked at Microsoft's intermediate solution (tchar) and now wchar and things like ...MessageW() etc, as a good migration solution.
I will try and look at the Unicode branch you mentioned above, and see what goodies you have for me there. I want more than a kid gets going halooweening!
I have used some of Microsoft's Unicode stuff when using COM, since you pretty much have to. My library does all of the conversions for you, but that would be simple enough not too, especially now that we have things like poke2() built-in.
At one point in the past, I actually did play around with win32lib, changing from the ANSI to Wide character routines, and displayed non-ASCII text. From the coding standpoint, it was pretty straight forward.
Matt