parsing HTML

new topic     » topic index » view thread      » older message » newer message

Can I parse HTML pages with Thomas Parslows's XML library? Will it work
cause HTML pages don't have so strict syntax like XML, for example: <P> tag
doesn't need closing </P> tag ...
That XML library would be ideal because it allows that you pass it XML data
by parts and I am not reading whole HTML site but reading it by pieces.

Or is there some other library for parsing HTML pages?
I need to get title of page and extract all links on page, that's for now,
something else might come up later.

Tone ©koda

new topic     » topic index » view thread      » older message » newer message

Search



Quick Links

User menu

Not signed in.

Misc Menu