[CWB] Encoding problem in CWB version 3.2.4
Hardie, Andrew
a.hardie at lancaster.ac.uk
Tue Sep 28 01:29:48 CEST 2010
Hi Tomaž,
This looks like two separate bugs. The first I have a reasonable suspicion as to the cause - the second is a bit more of a mystery! Can you tell me, what OS are you on?
Thanks
Andrew.
> -----Original Message-----
> From: cwb-bounces at sslmit.unibo.it
> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Tomaž Erjavec
> Sent: 27 September 2010 11:36
> To: cwb at sslmit.unibo.it
> Subject: [CWB] Encoding problem in CWB version 3.2.4
>
> Hi,
> this is probably some silly error, but I can't figure out
> where I made it. I tried using cwb-3.2.4 and made the corpus
> ok. The corpus is in
> utf-8 and I used cwb-encode -c utf8
> But when I try using it:
>
> [tomaz at mantra ~]$ cqp
> [no corpus]> JOS100K-EN;
> JOS100K-EN> "kaj";
> CQP Error:
> Query includes a character or character sequence
> that is invalid in the encoding specified for this corpus
> JOS100K-EN>
>
> which is strange, and the query uses only ASCII. Any hints?
>
> Also, I noticed that if no corpus is selected, cqp dies:
>
> [tomaz at mantra ~]$ cqp
> [no corpus]> "kdo";
> CQP Error:
> No corpus activated
> Segmentation fault
> [tomaz at mantra ~]$
>
> Best,
> Tomaž
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
More information about the CWB
mailing list