Accented characters in identifiers
- Posted by CChris <christian.cuvier at agricultur?.?ouv.fr> May 27, 2008
- 797 views
Currently, if you use characters in the 128-255 range in identifiers, you will get incongruous error messages, like "Result of a function must be assigned" because you used a รณ. This comes from the shrouding method Euphoria had been using long ago. Rob himself admitted supporting was becoming obsolte. Implementationwise, the move is simple: change the character class of all those chars from KEYWORD or BUILTIN to LETTER in the scanner. Nothing else (a couple if branches and constants will become dead code). Since characters that display as a letter in some code page may display differently on another, I think including the whole 128..255 range as valid characters is better than restricting it. If a char is valid somewhere, it must be valid anywhere, even if it displays funny. What do you think? CChris