[CWB] invalid UTF8 string passed to cl_string_canonical...

"Andrés Chandía" andres at chandia.net
Mon May 9 15:31:19 CEST 2016



I'm geting this error message when aligning but I don't know how to deal with it, I just found
one comment about it, it didn't help me though, thanks.

OPENING btcataladeutsch_ca
[205899 tokens, 7733 <s_id> regions]
OPENING btcataladeutsch_de [112264 tokens,
4951 <s_id> regions]
LEXICON SIZE: 24709 / 19889
FEATURE: character count,
weight=1 ... [1]
FEATURE: Shared words, threshold=40.0%, weight=50 ... [6]
FEATURE:
3-grams, weight=3 ... CL: major error, invalid UTF8 string passed to cl_string_canonical...
CL: major error, invalid UTF8 string passed to cl_string_canonical...
CL: major error,
invalid UTF8 string passed to cl_string_canonical...
CL: major error, invalid UTF8 string
passed to cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
[21952]
FEATURE: 4-grams, weight=4 ... CL: major error,
invalid UTF8 string passed to cl_string_canonical...
CL: major error, invalid UTF8 string
passed to cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
[614656]
[636615 features allocated]
[290636 entries in
source text feature map]
[296034 entries in target text feature map]
PASS 2: Setting
character count weight.
PASS 2: Processing shared words (th=40.0%).
PASS 2:
Processing 3-grams.
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
PASS 2: Processing 4-grams.
CL: major error, invalid UTF8
string passed to cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
CL: major error, invalid UTF8 string passed to
cl_string_canonical...
PASS 2: Creating character counts.

_______________________
            andrés
chandía

administrador de:
parles.upf | delingua | amind
terapia | mapuche koyaktu | mail ong mapuche koyaktu | mail psicoaching |
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160509/c06e7d0f/attachment.html>


More information about the CWB mailing list