chiark / gitweb /
fix regexp not to loop stupidly:
authorIan Jackson <ijackson@chiark.greenend.org.uk>
Mon, 9 Sep 2013 11:36:33 +0000 (12:36 +0100)
committerIan Jackson <ijackson@chiark.greenend.org.uk>
Mon, 9 Sep 2013 11:36:33 +0000 (12:36 +0100)
commitea225eac1b0e3d2b85e092b8ea457031efcd5ee4
tree7fd36113c940c0955e02cb5ccfd7851696f32dc0
parentddcf3796d38339cec7473d9189a97c41e5feb02e
fix regexp not to loop stupidly:
1. the regexp was too loose and matched /in/ not just /\bin\b/.
2. chiark.peer.fu-berlin.de consists mostly of stopwords by this rule
3. A bug meant that when it got to the end, it didn't stop, but always ate the TLD as if it were a stopword.
cgi