Probably the first step would be a good, non-regular-expression (IIRC, URLs are not regular) based link parser that's ported to every possible language on the planet.
Albeit the typical first reaction might be that of XKCD 927 (I must confess it was mine, at least), I believe that the regular expressions area is territory that could see some innovation. If told to pick just one feature from eggex, I'd choose the ability to parse statically!
FWIW I think this deserves a submission of its own, so here it is:
It's been a long time coming, but I believe very strongly in it.
"pain in the ass to integrate" -- I can sympathize. I'm working tirelessly to make this as easy to integrate as regexes are in Perl, Python, Ruby, etc.
http://perl.plover.com/NPC/NPC-3SAT.html
reply