On Wed, Aug 29, 2001 at 05:58:15PM +0900, ts wrote:
> 
> pigeon% cat aa.html
> <!DOCTYPE HTML PUBLIC "-//IETF//DTD HTML//EN">
> <HTML>
> <HEAD>
>  <TITLE>aa</TITLE>
> </HEAD>
> <BODY>
> 
> Numbers 2<1
> 
> </BODY>
> </HTML>
> pigeon% 
> 
>  There are 2 errors in this script
>   * he want to validate an HTML document and a regexp is not the right tool
> for this 

validating:     not really

using a regexp: Until minutes ago I didn't know the above is valid
                HTML. If the "2<1" wouldn't be legal I wouldn't know
                what is so bad about a regexp. Of course it is not a
                validator. But the fuzziness the regexp introduces,
                resembles the fuzziness most browsers use when
                interpreting webpages quite good (even though not
                perfect).

>   * it try to separate comment from the rest of the document, i.e. the
> biggest error is here
> 
> #########################################################
> # first we'll shoot all the <!-- comments -->
> #########################################################

What is so bad about that?

-- 
marko schulz