Re: Anyone doing a Java-based robot yet?

John D. Pritchard (jdp@cs.columbia.edu)
Mon, 19 Feb 1996 17:02:35 -0500


> >
> I has ftp and http. It does not have (since the BETA):
>

> HTML parsing

net.www.html.Parser
net.www.html.Document

> WWW date handling
> robots.txt parsing/handling
> a UserAgent concept

i think this could be nicely constructed under the net package

> NNTP, Gopher et al.

net.nntp

> It does not allow the UserAgent control over HTTP so one can not
> utilize the information in that protocol. Its URL support is also
> pretty basic.

i would guess that it's good for subclassing and extending

> BTW - It does have urlencode (but not urldecode) see :
> http://www.javasoft.com/JDK-1.0/api/java.net.URLEncoder.html#_top_

java.net.html.URL.toString()?

> Java is also slow and, IMHO, not well integrated with databases etc.
> If one wanted to compare it with Perl -- it also lacks text
> parsing.

java.util.StringTokenzier
java.io.StringTokenzier

-john