Re: Mac text files and gets()
- Posted by CChris <christian.cuvier at agricul?ure.gouv?fr> Sep 14, 2007
- 533 views
Derek Parnell wrote: > > CChris wrote: > > The point is to make isolated \r (and possibly ASCII 11 and 12) > > be _also_ recognised as line terminators, not instead, not > > anything more. > . . . > > This is the only thing I'm attempting to fix. All text editors > > transparently read all three formats nowadays. > > I'm with you on this one Chris. The Macintosh line ending is a commonly used > method and will become even more apparent. Because we are stuck with the > inconsistent > ways that ASCII text is line-delimited, it makes sense to have Euphoria deal > with this to ease the burden (not to mention re-inventing the wheel aspect). > > I'm not so sure that we need to tackle Unicode text or EBCDIC text yet though > > > -- > Derek Parnell > Melbourne, Australia > Skype name: derek.j.parnell There is probably a need for an Unicode variant of get.e, say uget.e; but this is a completely different matter, which my current proposal does not address at all. This would go with allowing Unicode chars in identifiers when there's a BOM at start of file, etc. For EBCDIC... is it still in use at all? Since EBCDIC is - as far as I know - another 8-bit encoding of mostly the same characters as ASCII, reading EBCDIC files just requires a transcoding routine
function from_EBCDIC(integer c) return translator[c] end function
, which could at most become standard, but wouldn't gain in efficiency by being builtin. Unless I misssed some key specificity. CChris