Re: euphoria text processing
- Posted by DerekParnell (admin) Jun 02, 2013
- 2040 views
Hello all.
I am considering stepping into euphoria .. i have a 64 bit machine, so instead of linking the 32 bit stuff, i iwll go with the bleeding edge version.
but before i do so, would anyone like to give me a taste of text processing (e.g. processing raw LaTex text, or mathematical expressions, written with unicode) in euphoria?
thanks a bunch. s
What do mean by "text processing"? That term can mean a whole lot of things to different people.
In Euphoria, all characters are stored as UTF-32 integers, however the standard library routines to read and write text from files still only work in 8-bit characters. You would need to get hold of routines to read/write Unicode characters from files.
However, once you have the text data in a sequence, you can easily "process" it. The big exceptions to this at the moment are that certain standard library routines don't recognise language specific characteristics when it comes to collating, upper-lower case conversion, right-to-left text direction, etc ... What is needed is an API to be developed for Euphoria to use established libraries for Unicode processing, such as IBM's ICU