Re: Just when you thought it might be interesting to standardize robots.txt...

Klaus Johannes Rusch (e8726057@student.tuwien.ac.at)
Wed, 4 Dec 1996 23:26:42 CET


In <2.2.32.19961204195829.006c2074@intrinsa.com>, dbakin@intrinsa.com (Dave Bakin) writes:
> Haven't heard this discussed yet, but check out
> http://www.packet.com/packet/96/49/index2a.html for a discussion of a
> proposed "sitelist.txt" from InfoSeek. -- Dave

Similar ideas have been discussed before, I think robots.txt and sitelist.txt
have different goals and can coexist.

robots.txt is for robots who want to index pages themselves, following their
design principles, and need guidance where not to go. Not much potential for
abuse in robots.txt.

sitelist.txt is for robots who trust the owner of a site to compile relevant
summary information. Plenty of potential for abuse in sitelist.txt (similar to
adding unrelated keywords all over the pages).

Klaus Johannes Rusch

--
e8726057@student.tuwien.ac.at, KlausRusch@atmedia.net
http://www.atmedia.net/KlausRusch/
_________________________________________________
This messages was sent by the robots mailing list. To unsubscribe, send mail
to robots-request@webcrawler.com with the word "unsubscribe" in the body.
For more info see http://info.webcrawler.com/mak/projects/robots/robots.html