[CWB] CQPweb - managing metadata

Hardie, Andrew a.hardie at lancaster.ac.uk
Wed Jul 21 15:44:31 CEST 2010


Hi Claudia,

 

Could you try encoding without the explicit text_id and text_lang elements in your input file? CQPweb assumes that input files will be valid XML, and that s-attributes like text_id and text_lang are to be inferred from the attributes of text. So spelling them out may have caused the problem.The file  ___install_temp_metadata_illum01 should have been created by cwb-s-decode from the text_id s-attribute, so the fact that it was missing suggests that this s-attribute is not available.

 

On the more general point about metadata: in this case the “minimalist metadata” is probably what you want so you are going about it the right way. As the manual explains “The metadata file should be a tab-delimited database. The first column should be the text id-codes, with a line for each text. You can then have as many columns of metadata as you need.” If you haven’t got a table of information like this, then the minimalist-metadata generates a dummy table for you. “Entering metadata fields” simply means specifying what the columns in your table of information contain, so is not relevant if you don’t have such a table.

 

best

 

Andrew.

 

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Claudia Borg
Sent: 20 July 2010 15:32
To: CWB mailing list
Subject: [CWB] CQPweb - managing metadata

 

Hi all,

I am trying to install my own corpus though cqpweb - I have a simple vertical text file in the following structure:

<text id="illum01" lang="Maltese">
<text_id "illum01">
<text_lang "Maltese">
<s>
word1
word2
...
</s>
</text_lang>
</text_id>
</text>

there is no annotation (pos, lemma, ect) so its basically like a word list. The corpus installation process goes well (I used default p-attributes, even if in reality I only have word attribute - in future I will add pos and lemma but for the time being I am just trying to get used to cqpweb), but then I need to install the metadata, and I cannot quite understand what is required here. 

If I try to create a minimalist metadata table without specifying anything in the manage metatdata page, then I get this error:

A mySQL query did not run successfully!

Error # 2: 
File '/home/mlrs/corpora/system/temp/___install_temp_metadata_illum01' not found (Errcode: 2) 



from mysql admin, I see that the table text_metadata_for_illum01 has been created but it is empty (no rows).

If I try to enter some metadata fields (which I cannot clearly understand what's meant to be here), then I still get the above error. 

I cannot seem to find anything specific to this problem in the documentation (i.e. explaining what metadata should look like, ect.).  I am mainly following:
http://cwb.svn.sourceforge.net/viewvc/cwb/gui/cqpweb/trunk/doc/CQPweb-installing-corpora.html

Any pointers would be appreciated.

Regards 
Claudia



-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20100721/1147f104/attachment.htm


More information about the CWB mailing list