[CWB] Appending text to an existing corpus

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu Nov 8 10:53:27 CET 2012


Unfortunately not, you need to re-index from scratch. A p-attribute includes frequency data and a reverse index as well as the actual corpus data, not to mention that the latter is compressed using codes dependent on  frequency: so adding anything means the whole p-att must change.

Best

Andrew.



Nik <cqplist at nikvdp.com> wrote:


Hi all,
I have a pretty simple question: is there any way to append text to an existing corpus?

We're working on a corpus based on data collected from a webcrawler and would like to periodically  update the corpus with new data from the crawler. 


More information about the CWB mailing list