[CWB] Maximum corpus size exceeded

Scott Sadowsky ssadowsky at gmail.com
Thu Mar 30 10:21:26 CEST 2017


Hi all,

I just got this warning for the first time:

WARNING: Maximal corpus size has been exceeded.
         Input truncated to the first 2147483647 tokens (file
/home/homebox/Corpora/source-files//input.vrt, line #3161375683).
Warning: missing </s> tag inserted at end of input.

Is there any way around this, by chance? That's 2^31, just a bit shy of 32
bits, but I'm on a 64 bit system with ext4 filesystems, so I assume the
issue is CQB related.

Cheers!
Scott
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20170330/685e34b1/attachment.html>


More information about the CWB mailing list