Re: UTF-8

new topic     » goto parent     » topic index » view thread      » older message » newer message
Vinoba said...

I think 31 bits is OK, but i will look at it again and report back in detail. As a quick comment, the absence of the higher-most bit in 4 bytes might only affect some (hopefully minor) East Asian languages. Of course with 63 or 64 bits we will be able to accommodate all the Planetary and many of the trans-universe languages

To be clear, inside a sequence (or a declared atom), the promotion from integer to double is automatic and mostly transparent.

Vinoba said...

I was wondering if you have looked at Microsoft's intermediate solution (tchar) and now wchar and things like ...MessageW() etc, as a good migration solution.

I will try and look at the Unicode branch you mentioned above, and see what goodies you have for me there. I want more than a kid gets going halooweening!

I have used some of Microsoft's Unicode stuff when using COM, since you pretty much have to. My library does all of the conversions for you, but that would be simple enough not too, especially now that we have things like poke2() built-in.

At one point in the past, I actually did play around with win32lib, changing from the ANSI to Wide character routines, and displayed non-ASCII text. From the coding standpoint, it was pretty straight forward.

Matt

new topic     » goto parent     » topic index » view thread      » older message » newer message

Search



Quick Links

User menu

Not signed in.

Misc Menu