[CWB] Encoding error in Windows
Hardie, Andrew
a.hardie at lancaster.ac.uk
Fri Apr 8 23:15:42 CEST 2011
It means the encoding hasn't been set to utf8. This is possibly because
you haven't specified the encoding using -c utf8 (cwb-encode defaults to
Latin-1 if not told specifically what encoding to use)
On the other hand, if you have specified that it is utf-8, then it may
be a bug. If this is the case, could you specify precisely what command
line you've been using? Thanks.
best
Andrew.
________________________________
From: cwb-bounces at sslmit.unibo.it
[mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of George Goce Mitrevski
Sent: 08 April 2011 22:09
To: Open source development of the Corpus WorkBench
Subject: [CWB] Encoding error in Windows
Can someone please explain what's causing this encoding error
when I try to encode corpus in Window in utf8?
"Encoding error: an invalid byte or byte sequence for charset
"latin1" was encountered."
Thanks much.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20110408/5b14b480/attachment.htm
More information about the CWB
mailing list