[CWB] Indexing problems

Stefan Evert stefanML at collocations.de
Thu Jul 22 15:46:24 CEST 2010


> Removing them solved most of my problems, I still get the syntax error
> messages but they are probably caused by stray double quotes in the
> attributes (line numbers are still not very helpful in identifying the
> problem though...)

They should be, though. :-/

If you can spare a little time, could you test with cwb-encode from the 3.0.0 release, please?  The only possible explanation I can think of is that buffer overflows from very long lines throw off cwb-encode's line counting.

I seem to remember that the bug fix back then was triggered by spurious error messages like the one you got: if the <text ...> line is very long, it doesn't fit into the line input buffer, and the incomplete tag is reported as a syntax error by the attribute parser.  If there's a real bug hiding behind your problems, though, I'd like to investigate and fix it.

Cheers,
Stefan



More information about the CWB mailing list