document updated 13 years ago, on Jun 23, 2011
Modules that do some sort of HTML ⇒ text conversion.
(there are lots of ways to do this sort of thing, including piping it through w3m/lynx.... but there many different approaches, some of them rather smart, some rather dumb)
Extract targetted data
Ways to extract bits of an HTML file, without obliterating all tags.