<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=us-ascii">
<META content="MSHTML 6.00.6000.17095" name=GENERATOR></HEAD>
<BODY>
<DIV dir=ltr align=left><SPAN class=390151121-08042011><FONT face=Verdana
color=#000080 size=2>It means the encoding hasn't been set to utf8. This is
possibly because you haven't specified the encoding using <STRONG>-c utf8
</STRONG>(cwb-encode defaults to Latin-1 if not told specifically what encoding
to use) </FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=390151121-08042011></SPAN><SPAN
class=390151121-08042011><FONT face=Verdana color=#000080
size=2></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=390151121-08042011><FONT face=Verdana
color=#000080 size=2>On the other hand, if you <STRONG><EM>have</EM></STRONG>
specified that it is utf-8, then it may be a bug. If this is the case,
could you specify precisely what command line you've been using?
Thanks.</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=390151121-08042011><FONT face=Verdana
color=#000080 size=2></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=390151121-08042011><FONT face=Verdana
color=#000080 size=2>best</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=390151121-08042011><FONT face=Verdana
color=#000080 size=2></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=390151121-08042011><FONT face=Verdana
color=#000080 size=2>Andrew.</FONT></SPAN></DIV><BR>
<BLOCKQUOTE dir=ltr
style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #000080 2px solid; MARGIN-RIGHT: 0px">
<DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left>
<HR tabIndex=-1>
<FONT face=Tahoma size=2><B>From:</B> cwb-bounces@sslmit.unibo.it
[mailto:cwb-bounces@sslmit.unibo.it] <B>On Behalf Of </B>George Goce
Mitrevski<BR><B>Sent:</B> 08 April 2011 22:09<BR><B>To:</B> Open source
development of the Corpus WorkBench<BR><B>Subject:</B> [CWB] Encoding error in
Windows<BR></FONT><BR></DIV>
<DIV></DIV>
<DIV
style="FONT-SIZE: 12pt; COLOR: #000; FONT-FAMILY: times new roman, new york, times, serif; BACKGROUND-COLOR: #fff">
<DIV
style="FONT-SIZE: 12pt; FONT-FAMILY: 'times new roman', 'new york', times, serif">Can
someone please explain what's causing this encoding error when I try to encode
corpus in Window in utf8?</DIV>
<DIV
style="FONT-SIZE: 12pt; FONT-FAMILY: 'times new roman', 'new york', times, serif"><BR></DIV>
<DIV style="FONT-FAMILY: 'times new roman', 'new york', times, serif">
<DIV style="FONT-FAMILY: 'times new roman', 'new york', times, serif">
<DIV id=yiv154454282>
<DIV class=yiv154454282Section1 dir=rtl>
<DIV class=yiv154454282MsoNormal dir=ltr
style="DIRECTION: ltr; unicode-bidi: embed; TEXT-ALIGN: left"><FONT
class=Apple-style-span face=Arial><FONT class=Apple-style-span
size=2>"Encoding error: an invalid byte or byte sequence for charset "latin1"
was encountered."</FONT><BR></FONT></DIV>
<DIV class=yiv154454282MsoNormal dir=ltr
style="DIRECTION: ltr; unicode-bidi: embed; TEXT-ALIGN: left"><FONT
class=Apple-style-span face=Arial><FONT class=Apple-style-span
size=2><BR></FONT></FONT></DIV>
<DIV class=yiv154454282MsoNormal dir=ltr
style="DIRECTION: ltr; unicode-bidi: embed; TEXT-ALIGN: left"><FONT
class=Apple-style-span face=Arial><FONT class=Apple-style-span size=2>Thanks
much.</FONT></FONT></DIV></DIV></DIV></DIV></DIV></DIV></BLOCKQUOTE></BODY></HTML>