Re: Keyword indexing

Brian Ulicny (ulicny@limbex.com)
Wed, 19 Jun 96 01:16:06 -0700 (PDT)


>>Maintaining a list of important keywords by hand is probably not a good
>>idea. A better idea would be to use machine learning techniques to
>>automatically classify pages as relevant to computer science (or whatever
>>topic). Look at Gerard Salton's work for starters.
>
>I've never heard of machine learning technicques for classification by subject.
>Is the work of the person you mentioned, Gerald Salton, available on the Web?
>Or he the authour of a book? If you happened to know the name of the book,
>or some way of finding it, I'd be very appreciative.

The late Gerard Salton was a professor at Cornell. There is a very thorough
bibliography at:
http://www.informatik.uni-trier.de/%7Eley/db/indices/a-tree/s/Salton:Gerard.
html
A lot of his papers are available online.

You might also try looking at recent volumes of the SIGIR proceedings under
the rubric "automatic classification", "document clustering", etc.

Most of this stuff is easily found on the Web using search engines like Alta
Vista.

Best,

Brian Ulicny

Limbex Corporation V/mail: (310) 309-4281 x4505
13160 Mindanao Way, Suite 234 Fax: (310) 309 4282
Marina Del Rey, CA 90292 USA URL: http://www.limbex.com/