[CWB] CQPweb - managing metadata
Hardie, Andrew
a.hardie at lancaster.ac.uk
Wed Jul 21 15:44:31 CEST 2010
Hi Claudia,
Could you try encoding without the explicit text_id and text_lang elements in your input file? CQPweb assumes that input files will be valid XML, and that s-attributes like text_id and text_lang are to be inferred from the attributes of text. So spelling them out may have caused the problem.The file ___install_temp_metadata_illum01 should have been created by cwb-s-decode from the text_id s-attribute, so the fact that it was missing suggests that this s-attribute is not available.
On the more general point about metadata: in this case the “minimalist metadata” is probably what you want so you are going about it the right way. As the manual explains “The metadata file should be a tab-delimited database. The first column should be the text id-codes, with a line for each text. You can then have as many columns of metadata as you need.” If you haven’t got a table of information like this, then the minimalist-metadata generates a dummy table for you. “Entering metadata fields” simply means specifying what the columns in your table of information contain, so is not relevant if you don’t have such a table.
best
Andrew.
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Claudia Borg
Sent: 20 July 2010 15:32
To: CWB mailing list
Subject: [CWB] CQPweb - managing metadata
Hi all,
I am trying to install my own corpus though cqpweb - I have a simple vertical text file in the following structure:
<text id="illum01" lang="Maltese">
<text_id "illum01">
<text_lang "Maltese">
<s>
word1
word2
...
</s>
</text_lang>
</text_id>
</text>
there is no annotation (pos, lemma, ect) so its basically like a word list. The corpus installation process goes well (I used default p-attributes, even if in reality I only have word attribute - in future I will add pos and lemma but for the time being I am just trying to get used to cqpweb), but then I need to install the metadata, and I cannot quite understand what is required here.
If I try to create a minimalist metadata table without specifying anything in the manage metatdata page, then I get this error:
A mySQL query did not run successfully!
Error # 2:
File '/home/mlrs/corpora/system/temp/___install_temp_metadata_illum01' not found (Errcode: 2)
from mysql admin, I see that the table text_metadata_for_illum01 has been created but it is empty (no rows).
If I try to enter some metadata fields (which I cannot clearly understand what's meant to be here), then I still get the above error.
I cannot seem to find anything specific to this problem in the documentation (i.e. explaining what metadata should look like, ect.). I am mainly following:
http://cwb.svn.sourceforge.net/viewvc/cwb/gui/cqpweb/trunk/doc/CQPweb-installing-corpora.html
Any pointers would be appreciated.
Regards
Claudia
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20100721/1147f104/attachment.htm
More information about the CWB
mailing list