[CWB] suffix arrays based on CWB indexes

Serge Heiden slh at ens-lyon.fr
Thu Jan 6 13:36:42 CET 2011


Happy New Year to all,

Is someone aware of any implementation of suffix arrays algorithms
based on CWB indexes ?
We plan to develop token (versus character) based n-grams of any
length in the TXM context (http://textometrie.ens-lyon.fr/?lang=en)
which is based on CWB.
Milos Jakubicek said at PACLIC24 that Manatee (which could have
some similarity of architecture with CWB) integrates suffix
arrays, has anyone experience of that ?

Best,
Serge

-- 
Dr. Serge Heiden, slh at ens-lyon.fr, http://textometrie.ens-lyon.fr
ENS de Lyon/CNRS - ICAR UMR5191, Institut de Linguistique Française
15, parvis René Descartes 69342 Lyon BP7000 Cedex, tél. +33(0)622003883


More information about the CWB mailing list