[CWB] Encoding problem in CWB version 3.2.4

Hardie, Andrew a.hardie at lancaster.ac.uk
Tue Sep 28 01:29:48 CEST 2010


Hi Tomaž,

This looks like two separate bugs. The first I have a reasonable suspicion as to the cause - the second is a bit more of a mystery! Can you tell me, what OS are you on?

Thanks

Andrew.



> -----Original Message-----
> From: cwb-bounces at sslmit.unibo.it 
> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Tomaž Erjavec
> Sent: 27 September 2010 11:36
> To: cwb at sslmit.unibo.it
> Subject: [CWB] Encoding problem in CWB version 3.2.4
> 
> Hi,
> this is probably some silly error, but I can't figure out 
> where I made it. I tried using cwb-3.2.4 and made the corpus 
> ok. The corpus is in
> utf-8 and I used cwb-encode -c utf8
> But when I try using it:
> 
> [tomaz at mantra ~]$ cqp
> [no corpus]> JOS100K-EN;
> JOS100K-EN> "kaj";
> CQP Error:
>          Query includes a character or character sequence 
> that is invalid in the encoding specified for this corpus
> JOS100K-EN>
> 
> which is strange, and the query uses only ASCII. Any hints?
> 
> Also, I noticed that if no corpus is selected, cqp dies:
> 
> [tomaz at mantra ~]$ cqp
> [no corpus]> "kdo";
> CQP Error:
>          No corpus activated
> Segmentation fault
> [tomaz at mantra ~]$
> 
> Best,
> Tomaž
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
> 


More information about the CWB mailing list