[CWB] Setting entire document as context
Scott Sadowsky
ssadowsky at gmail.com
Sun Mar 13 21:00:43 CET 2011
On 03/13/2011 11:08 AM, Stefan Evert wrote:
>> If your documents are delimited with "text" tags, you'd use
>> set context 1 text ;
Unfortunately, they're not -- I'd just assumed that CWB's
corpus-building routine would store the source file names as a matter of
course.
Looks like I'm going have to reencode!
> 1) If there are large(ish) documents in your corpus, the next "cat" command with this context setting will probably crash CQP because of a buffer overflow (known problem, we plan to address this in v3.2, but will require fundamental changes in the way kwic lines are formatted). If you're happy with just the text, you can get around this in the following way. [...]
Thanks for the tip!
Cheers,
Scott
More information about the CWB
mailing list