Bryan Hackney
bbh@xenodata.com
----- Begin Included Message -----
From: =?ISO-8859-1?Q?Jaakko_Hyv=E4tti?= <Jaakko.Hyvatti@iki.fi>
Subject: Re: More Robot Talk
Nice to have something of a technical nature for a change here.
The heuristics for URL canonicalization presented here do not yet take
into account the new HTTP/1.1 Host: request header. It complicates the
matters more once some servers actually will have multiple virtual hosts
with a single IP address, differentiated by Host: headers or absolute
URI's in request headers (see rfc2068).
...
----- End Included Message -----
_________________________________________________
This messages was sent by the robots mailing list. To unsubscribe, send mail
to robots-request@webcrawler.com with the word "unsubscribe" in the body.
For more info see http://info.webcrawler.com/mak/projects/robots/robots.html