[CWB] Different character encoding problems

Josep M. Fontana josepm.fontana at upf.edu
Sat Aug 18 18:15:19 CEST 2012


Hi, after managing to install the latest version of CWB in order to 
solve the visualization problems I had with UTF-8 encoded corpora I just 
stumbled with another encoding "problem".

I installed an old corpus that was encoded in Latin-1 and when I do a 
search I see the typical symbols that appear for the characters with 
tilde when the parameters for character encoding are not set properly. I 
have tried to find information on this in the CQP manual but I have not 
been able to find any relevant information. The question is: is there 
any command or parameter in CQP that will allow me to visualize the 
characters properly?

JM


More information about the CWB mailing list