[CWB] Encoding problem in CWB version 3.2.4

Tomaž Erjavec tomaz.erjavec at ijs.si
Mon Sep 27 12:35:50 CEST 2010


Hi,
this is probably some silly error, but I can't figure out where I made 
it. I tried using cwb-3.2.4 and made the corpus ok. The corpus is in 
utf-8 and I used cwb-encode -c utf8
But when I try using it:

[tomaz at mantra ~]$ cqp
[no corpus]> JOS100K-EN;
JOS100K-EN> "kaj";
CQP Error:
         Query includes a character or character sequence that is invalid
in the encoding specified for this corpus
JOS100K-EN>

which is strange, and the query uses only ASCII. Any hints?

Also, I noticed that if no corpus is selected, cqp dies:

[tomaz at mantra ~]$ cqp
[no corpus]> "kdo";
CQP Error:
         No corpus activated
Segmentation fault
[tomaz at mantra ~]$

Best,
Tomaž


More information about the CWB mailing list