[CWB] Encoding problem in CWB version 3.2.4
Tomaž Erjavec
tomaz.erjavec at ijs.si
Mon Sep 27 12:35:50 CEST 2010
Hi,
this is probably some silly error, but I can't figure out where I made
it. I tried using cwb-3.2.4 and made the corpus ok. The corpus is in
utf-8 and I used cwb-encode -c utf8
But when I try using it:
[tomaz at mantra ~]$ cqp
[no corpus]> JOS100K-EN;
JOS100K-EN> "kaj";
CQP Error:
Query includes a character or character sequence that is invalid
in the encoding specified for this corpus
JOS100K-EN>
which is strange, and the query uses only ASCII. Any hints?
Also, I noticed that if no corpus is selected, cqp dies:
[tomaz at mantra ~]$ cqp
[no corpus]> "kdo";
CQP Error:
No corpus activated
Segmentation fault
[tomaz at mantra ~]$
Best,
Tomaž
More information about the CWB
mailing list