[CWB] CQPweb: number of texts not updated in view corpus metadata

José Manuel Martínez Martínez chozelinek at gmail.com
Fri May 25 13:00:04 CEST 2018


I'm answering myself.

I deleted cached queries, cached databases and cached frequency lists. And
then recreated all frequency lists. After that the info about the documents
is updated.

If you have any comments on this workflow, please let me know. By now, this
will be my routine to update corpora.

Cheers,


--
José Manuel Martínez Martínez
https://chozelinek.github.io

On Fri, May 25, 2018 at 12:52 PM, José Manuel Martínez Martínez <
chozelinek at gmail.com> wrote:

> The reason of doing the update of the corpus like this, instead of
> creating a new one, is that I want users to keep their query history
> associated to this corpus, and avoid most of the configuration steps. But
> if in the end this is going to introduce inconsistencies or problems, I
> will try to find a different way of updating corpora.
>
> Cheers,
>
> --
> José Manuel Martínez Martínez
> https://chozelinek.github.io
>
> On Fri, May 25, 2018 at 12:50 PM, José Manuel Martínez Martínez <
> chozelinek at gmail.com> wrote:
>
>> Hi there,
>>
>> I'm reindexing a corpus several times. Instead of going through the
>> process of installing a new corpus for each new version, what I do is to
>> remove the files in the data folder for that corpus and add the new
>> indices. Next, I go to manage frequency lists and recreate all frequency
>> lists.
>>
>> Then I see that the total number of tokens is updated, but not the number
>> of texts.
>>
>> Question 1: is what I do a bad practice?
>> Question 2: is the problem with the number of texts a bug?
>> Question 3: if the answer to 2 is no, how can I fix it?
>> Question 4: if the answer to 2 is yes, how would you avoid it?
>> Question 5: has the number of texts an impact somewhere else
>> (distributions, other measures?)
>>
>> Thanks in advance!
>>
>> Cheers
>>
>>
>> José Manuel Martínez Martínez
>> https://chozelinek.github.io
>>
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20180525/21e2e8d0/attachment.html>


More information about the CWB mailing list