[CWB] Encoding error in Windows

Hardie, Andrew a.hardie at lancaster.ac.uk
Fri Apr 8 23:15:42 CEST 2011


It means the encoding hasn't been set to utf8. This is possibly because
you haven't specified the encoding using -c utf8 (cwb-encode defaults to
Latin-1 if not told specifically what encoding to use) 
 
On the other hand, if you have specified that it is utf-8, then it may
be a bug. If this is the case, could you specify precisely what command
line you've been using? Thanks.
 
best
 
Andrew.


________________________________

	From: cwb-bounces at sslmit.unibo.it
[mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of George Goce Mitrevski
	Sent: 08 April 2011 22:09
	To: Open source development of the Corpus WorkBench
	Subject: [CWB] Encoding error in Windows
	
	
	Can someone please explain what's causing this encoding error
when I try to encode corpus in Window in utf8?

	"Encoding error: an invalid byte or byte sequence for charset
"latin1" was encountered."
	
	
	
	Thanks much.

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20110408/5b14b480/attachment.htm


More information about the CWB mailing list