Indexing two-byte text

Harry Munir Behrens (behrens@mtl.t.u-tokyo.ac.jp)
Wed, 06 Dec 1995 15:43:23 +0900


Hello there,

here at the Univ. of Tokyo we are currently installing Harvest and were
wondering if anybody has experience with the problems encountered
when indexing Japanese text. (no word boundaries, two-byte code etc.)
I would be very grateful for any help pointing me to an international
version of agrep/glimpse or something similar.

Cheers,

Harry Behrens
PhD. candidate
Dept. of Electrical Engineering
Univ. of Tokyo
behrens@mtl.t.u-tokyo.ac.jp