[CWB] invalid UTF8 string passed to cl_string_canonical...

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu May 12 03:56:51 CEST 2016


@ Andrés - while considering your original message again, I noticed your error message....

" CL: major error, invalid UTF8 string passed to cl_string_canonical... "

... is actually out of date. I changed it in March last year, to be more specific. Your version lacks this and a bag of other changes I made at that time.

Can you recompile with an up to date copy of the code (and also make sure your copy of Glib is as up to date as possible), and recheck to see if you still get the error messages? It's just possible the error will go away on its own.

The previous report of this error, from Ruprecht, *Also* went away upon recompilation, incidentally (because of newer Unicode tables, or so we thought at the time)

best

Andrew.

PS - @ Stefan, here's what I mean : https://sourceforge.net/p/cwb/code/624/

And here's the last time we ran into this:  http://devel.sslmit.unibo.it/pipermail/cwb/2015-March/thread.html thread called "[CWB] unicode problems with Greek and OCS   Ruprecht von Waldenfels" - start at the top and go down.

If our memories were only a bit better....


More information about the CWB mailing list