[CWB] UTF-8 encoding problem
Hardie, Andrew
a.hardie at lancaster.ac.uk
Tue Apr 23 11:36:01 CEST 2013
Hi Jie Jiang,
If full UTF8 support is important to you then v3.0 is not going to be good enough for the job. You need the most recent version (as of now, that's 3.4.6).
If you're a Linux user, the only way to get v 3.4.* is to build it from source following the instructions here:
http://cwb.sourceforge.net/developers.php
basically,
svn co http://svn.code.sf.net/p/cwb/code/cwb/trunk cwb
And then the file "INSTALL" has instructions on making build-plus-install easy.
best
Andrew.
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Jie Jiang
Sent: 23 April 2013 10:30
To: cwb at sslmit.unibo.it
Subject: [CWB] UTF-8 encoding problem
Hi all:
I'm a newbie in CWB. It is a great tool, and I do wish to use it for corpus management.
However, I'm quite concerned that UTF-8 is not well supported as reported in the documentation for version 3.0, so I'm wondering how to word around this issue since UTF-8 is very important for me.
By the way, I noticed version 3.2 and 3.4 are only available as windows installers on SF, but are they available for Linux users as well please?
Thank you in advance!
Best regards!
Jie Jiang
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130423/ac732929/attachment-0001.html>
More information about the CWB
mailing list