[CWB] cwb-makeall error during indexing

Giorgina Cerutti Benitez Giorgina.Cerutti at unige.ch
Mon Feb 29 16:27:01 CET 2016


Its recurrence is strange, indeed. I specified the XML structure directly on the “install corpus” form.

De : cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] De la part de Hardie, Andrew
Envoyé : lundi 29 février 2016 16:20
À : Open source development of the Corpus WorkBench <cwb at sslmit.unibo.it>
Objet : Re: [CWB] cwb-makeall error during indexing

Yes, that is *definitely* the bug I had thought I fixed! it’s here in the cwb-encode arguments:

-S +id -S +id+num+title+lang -S +id+num+title+lang

The bug was in the assembly of the S-attribute string – the name of the attribute is missing. But, as I said, I’ve indexed corpora on my own server on the present version of the code without being hit by this bug. So its recurrence is puzzling.

Can you tell me – did you use an XML template, or did you specify the XML structure directly on the “install corpus” form?

best

Andrew.

From: cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Giorgina Cerutti Benitez
Sent: 29 February 2016 14:15
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] cwb-makeall error during indexing

Hi Andrew,

Thank you very much for your quick reply.

This is the precise text of the error:

“**** CQP ERROR **** REGISTRY ERROR (/export/data/CQPweb_data/registry/t20b): syntax error REGISTRY ERROR (/export/data/CQPweb_data/registry/t20b): Error parsing the main Registry structure. REGISTRY ERROR (/export/data/CQPweb_data/registry/t22): syntax error REGISTRY ERROR (/export/data/CQPweb_data/registry/t22): Error parsing the main Registry structure.
CQPweb encountered an error and could not continue.


cwb-makeall reported an error! Corpus indexing aborted. <pre>cwb-encode -xsB -c utf8 -d /export/data/CQPweb_data/corpus/t22b -f /export/data/CQPweb_data/upload/test20.vrt -R "/export/data/CQPweb_data/registry/t22b" -S +id -S +id+num+title+lang -S +id+num+title+lang 2>&1 s-attribute <text> not declared, inserted literally (file /export/data/CQPweb_data/upload/test20.vrt, line #1, warning issued only once). s-attribute <story> not declared, inserted literally (file /export/data/CQPweb_data/upload/test20.vrt, line #2, warning issued only once). s-attribute <s> not declared, inserted literally (file /export/data/CQPweb_data/upload/test20.vrt, line #3, warning issued only once). cwb-makeall -r "/export/data/CQPweb_data/registry" -V T22B 2>&1 REGISTRY ERROR (/export/data/CQPweb_data/registry/t22b): syntax error REGISTRY ERROR (/export/data/CQPweb_data/registry/t22b): Error parsing the main Registry structure. Corpus T22B not found in registry /export/data/CQPweb_data/registry . Aborted.</pre>


Let me know if you need further details.

Thank you very much for your time and for your help.

Regards,

Giorgina

De : cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it> [mailto:cwb-bounces at sslmit.unibo.it] De la part de Hardie, Andrew
Envoyé : lundi 29 février 2016 15:03
À : Open source development of the Corpus WorkBench <cwb at sslmit.unibo.it<mailto:cwb at sslmit.unibo.it>>
Objet : Re: [CWB] cwb-makeall error during indexing

Hi Giorgina,

I did actually write a reply to your similar email of 4 days ago (@ 25 Feb 11.18) but it looks like it didn’t post to the list. Probably a fault at my end – I was travelling and my net connection was unreliable.

Here’s what I said:

Hi Giorgina,

This is pretty peculiar – I was reasonably sure I’d fixed this one. You don’t say whether you’re checked out from the trunk or from the 3.2.6 branch, but either way it should be fixed: I’ve used this version of the code myself to index corpora without errors.

Can you send the precise text of the error to the list?

thanks

Andrew.



From: cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Giorgina Cerutti Benitez
Sent: 29 February 2016 13:57
To: Open source development of the Corpus WorkBench
Subject: [CWB] cwb-makeall error during indexing

Hi everyone,

We have a cwb-makeall error during when installing our corpora in version 3.2.6, which I believe has been dealt with in this thread: http://devel.sslmit.unibo.it/pipermail/cwb/2016-February/002227.html As it is explained there, we tried to fix this bug by updating CQPweb from SVN, but the it is still there. Does anyone know how to fix this?

Thank you very much for your help.

Best regards,

Giorgina Cerutti
Assistant
Department of Translation – Spanish Unit
Faculty of Translation and Interpreting
University of Geneva
Office 6242 – Uni Mail
40 bd du Pont d'Arve
CH-1211 Genève 4
[cid:image007.png at 01D1127F.0F2785D0]<https://www.linkedin.com/pub/giorgina-cerutti/20/337/7a0/en>[Facebook]<https://www.facebook.com/UNES.FTI.UNIGE>[Twitter]<https://twitter.com/giorginacerutti>[Transius_EN]<http://transius.unige.ch/en/>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160229/ed25b33f/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 967 bytes
Desc: image001.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160229/ed25b33f/attachment-0003.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.png
Type: image/png
Size: 1016 bytes
Desc: image002.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160229/ed25b33f/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image003.jpg
Type: image/jpeg
Size: 813 bytes
Desc: image003.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160229/ed25b33f/attachment-0001.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image004.png
Type: image/png
Size: 1007 bytes
Desc: image004.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160229/ed25b33f/attachment-0005.png>


More information about the CWB mailing list