[CWB] Character encoding revisited

Teresa Molés Cases teresamoles at gmail.com
Wed Jun 25 18:56:53 CEST 2014


Hi, Josep M.,

I’m not sure if this will help you, but I had a similar problem and when accessing Putty, before validating, in the ‘Translation’ pannel (inside ‘Window'), I changed manually the encoding character and everything worked perfectly after this. Good luck!

Best,

Teresa

El 25/06/2014, a las 18:41, Josep M. Fontana <josepm.fontana at upf.edu> escribió:

> Hi,
> 
> Our corpus is encoded in UTF-8 but when I create a text file with the output of some search I get the typical odd characters one gets when the conversion has gone wrong. I used the 'file' command and I saw that the text files are sometimes encoded as ISO-8859 and some other times as ASCII. Is there anyway to configure things so that the UTF-8 character set is maintained? Thanks.
> 
> 
> Josep M.
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb






-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140625/9bd2d178/attachment.html>


More information about the CWB mailing list