[CWB] Character encoding revisited
Teresa Molés Cases
teresamoles at gmail.com
Wed Jun 25 18:56:53 CEST 2014
Hi, Josep M.,
I’m not sure if this will help you, but I had a similar problem and when accessing Putty, before validating, in the ‘Translation’ pannel (inside ‘Window'), I changed manually the encoding character and everything worked perfectly after this. Good luck!
El 25/06/2014, a las 18:41, Josep M. Fontana <josepm.fontana at upf.edu> escribió:
> Our corpus is encoded in UTF-8 but when I create a text file with the output of some search I get the typical odd characters one gets when the conversion has gone wrong. I used the 'file' command and I saw that the text files are sometimes encoded as ISO-8859 and some other times as ASCII. Is there anyway to configure things so that the UTF-8 character set is maintained? Thanks.
> Josep M.
> CWB mailing list
> CWB at sslmit.unibo.it
-------------- next part --------------
An HTML attachment was scrubbed...
More information about the CWB