[CWB] CWB Digest, Vol 84, Issue 20

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu Jan 30 10:58:07 CET 2014


>> Can I populate this table some other way?

No. You need to find out what is going wrong when you do it this way, or any other way would fail as well.

Can you check that the necessary text ids are properly encoded, i.e. do a CQP-syntax query for

<text_id "10038"> []

And see if  it returns the first word of text 10038, as expected.

best

Andrew.

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Andres Chandia
Sent: 29 January 2014 17:22
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] CWB Digest, Vol 84, Issue 20

Hi there,

I retake this issue because is something similar what is happening to me now, the corpus is bigger though, 18Gb.
when I try to populate the text metadata table with begin/end offset positions, I get no results, I mean I get the table created but all the velues are like this:
| text_id | words | cqp_begin | cqp_end |
| 10038   |     0 |         0 |       0 |
| 6570    |     0 |         0 |       0 |
| 4099    |     0 |         0 |       0 |
| 9887    |     0 |         0 |       0 |
| 819     |     0 |         0 |       0 |
| 4910    |     0 |         0 |       0 |
| 7669    |     0 |         0 |       0 |
| 2889    |     0 |         0 |       0 |
| 9627    |     0 |         0 |       0 |
| 5265    |     0 |         0 |       0 |
| 1076    |     0 |         0 |       0 |
| 6196    |     0 |         0 |       0 |
| 4213    |     0 |         0 |       0 |
| 1212    |     0 |         0 |       0 |
| 4688    |     0 |         0 |       0 |

so the interface for this corpus always says : The text metadata table has not yet been populated with begin/end offset positions.
I have put the interface in debug mode and here you have what I get: sdewac-debug.7z

Can I populate this table some other way?

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140130/1d04969e/attachment.html>


More information about the CWB mailing list