[CWB] UTF-8 encoding problem

Hardie, Andrew a.hardie at lancaster.ac.uk
Tue Apr 23 11:36:01 CEST 2013


Hi Jie Jiang,

If full UTF8 support is important to you then v3.0 is not going to be good enough for the job. You need the most recent version (as of now, that's 3.4.6).

If you're a Linux user, the only way to get v 3.4.* is to build it from source following the instructions here:

http://cwb.sourceforge.net/developers.php

basically,

svn co http://svn.code.sf.net/p/cwb/code/cwb/trunk cwb

And then the file "INSTALL" has instructions on making build-plus-install easy.

best

Andrew.

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Jie Jiang
Sent: 23 April 2013 10:30
To: cwb at sslmit.unibo.it
Subject: [CWB] UTF-8 encoding problem

Hi all:
I'm a newbie in CWB. It is a great tool, and I do wish to use it for corpus management.
However, I'm quite concerned that UTF-8 is not well supported as reported in the documentation for version 3.0, so I'm wondering how to word around this issue since UTF-8 is very important for me.
By the way, I noticed version 3.2 and 3.4 are only available as windows installers on SF, but are they available for Linux users as well please?
Thank you in advance!

Best regards!

Jie Jiang
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130423/ac732929/attachment-0001.html>


More information about the CWB mailing list