Re: Mac text files and gets()

new topic     » topic index » view thread      » older message » newer message

Derek Parnell wrote:
> 
> CChris wrote:
> > The point is to make isolated \r (and possibly ASCII 11 and 12) 
> > be _also_ recognised as line terminators, not instead, not
> > anything more.
>  . . .
> > This is the only thing I'm attempting to fix. All text editors
> > transparently read all three formats nowadays.
> 
> I'm with you on this one Chris. The Macintosh line ending is a commonly used
> method and will become even more apparent. Because we are stuck with the
> inconsistent
> ways that ASCII text is line-delimited, it makes sense to have Euphoria deal
> with this to ease the burden (not to mention re-inventing the wheel aspect).
> 
> I'm not so sure that we need to tackle Unicode text or EBCDIC text yet though
> blink
> 
> -- 
> Derek Parnell
> Melbourne, Australia
> Skype name: derek.j.parnell

There is probably a need for an Unicode variant of get.e, say uget.e; but this
is a completely different matter, which my current proposal does not address at
all. This would go with allowing Unicode chars in identifiers when there's a BOM
at start of file, etc.

For EBCDIC... is it still in use at all?
Since EBCDIC is - as far as I know - another 8-bit encoding of mostly the same
characters as ASCII, reading EBCDIC files just requires a transcoding routine
function from_EBCDIC(integer c) return translator[c] end function

, which could at most become standard, but wouldn't gain in efficiency by being
builtin. Unless I misssed some key specificity.

CChris

new topic     » topic index » view thread      » older message » newer message

Search



Quick Links

User menu

Not signed in.

Misc Menu