[Gllug] sed question: hyperlinking URLs

Tethys sta296 at astradyne.co.uk
Thu Jan 25 01:40:54 UTC 2007


Joel Bernstein writes:

>I'm not convinced that's correct. The first regex I wrote used a (?!\.)
>negative lookahead assertion (in Perl) but I realised it didn't need to
>be anything like as complex. Requiring the last character to be a word
>character is enough for this particular example, since no TLD that I am
>aware of ends in anything else. He won't get [,=+/% ...] in the domain
>name either, so the original character class is overly complex.

True, no domain name should end in anything but [-0-9A-Za-z] according
to RFC1035. However, with the (shortsighted IMHO) onset of full i18n
domain names, that may no longer be true for much longer. Also, a URL
doesn't have to end at the domain name. Many will also have a path
component.

Tet
-------------- next part --------------
-- 
Gllug mailing list  -  Gllug at gllug.org.uk
http://lists.gllug.org.uk/mailman/listinfo/gllug


More information about the GLLUG mailing list