[CWB] CQP bug report?

Eros Zanchetta eros at sslmit.unibo.it
Thu Feb 26 15:45:54 CET 2009


Hi Stefan!

> Is there any way for me to get access to the corpus (or, even better,
> the server on which it is installed), so I can test it with a debug
> build of CQP? 

Sure, I'll send you a private e-mail with the details of your new
account on the server.

> It's quite likely that the error happens while scanning the lexicon
> file, i.e. before the query is actually executed on the corpus, and
> that it has to do with the implementation of %c/%d flags (which store
> the normalised string in a fixed-size buffer).  Could you quickly
> check the following to things on ITWAC or the problematic subset
> ITWAC_20, please?
>
> cwb-lexdecode -p cane -c -d ITWAC_20

Segmentation fault

> cwb-lexdecode ITWAC_20 | perl -nle '$l = length($_); $max = $l if $l >
> $max; END{print $max}'

I'm not exactly sure what I'm doing here, anyway the result I get is
7000, if this is the length of the longest token, then maybe this is the
problem.

E


More information about the CWB mailing list