[CWB] problem at managing corpus metadata

Hardie, Andrew a.hardie at lancaster.ac.uk
Sat Jan 4 16:46:22 CET 2014


I'm sorry I don't understand the questions. Can you rephrase/elaborate.

best

Andrew.

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Andres Chandia
Sent: 04 January 2014 15:31
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] problem at managing corpus metadata

Some questions then...

1. if you have to index the metadata as an s-attribute in only one field when you index a corpus, what are the rest of the fields for, how and what for can they be used to?

2. if you index corpus via command line with cwb would the s-attributes be availabe in the way I intended to index them, the way that you say is not available yet by cqp interface?

3. at the standard query there is the restriction option, but this one does not get activated when the corpora is indexed, how shoul I proceed at indexing process to activate this?

Thanks


El Mar, 31 de Diciembre de 2013, 20:50, Hardie, Andrew escribió:

1. Yes, of course, because metadata works at the text level. So if you index text metadata from something on the element, then naturally only 1 element per text will actually have any effect. There is a warning in the interface to this effect:
The following XML annotations are indexed in the corpus.
Select the ones which you wish to use as text-metadata fields.
Note: you must only select annotations that occur at or above
the level of in the XML hierarchy of your corpus
What you seem to actually want is to be able to restrict your queries to particular elements depending on their attributes. CQPweb can't do this. Queries can only be restricted to particular *texts*, not to sub-parts of texts. XML Restricted Queries is a much-requested feature and one I hope to be able to implement once the database reorganisation in v3.1 is done. But it can't be done now.

2. Either switch your checkout over from the trunk to the URL of the 3.0 branch, or just manually copy the code available in the download tarball.
best
Andrew.
From: Andres Chandia [mailto:andres at chandia.net]
Sent: 31
December 2013 19:32
To: Hardie, Andrew
Cc: Open source development of the Corpus WorkBench
Subject: RE: [CWB] problem at managing corpus metadata
Thanks, I turned back to the previous one that I had CQPweb v3.0.7 © 2008-2012 and all went well
except for this:

1. I have introduced at the second line s-attributes this way: s:0+id+type, then at the restricted query for the s_id it only appears the data for S1 and S3, but not for S2 and S4, if you see the corpus text_1 owns S1 and S2, text_2 owns S3 and S4, so only appears the first S of each text


And I take the opportunity to ask you how do I upgrade with svn to the version you are recommending


Thanks, and If you don't answer right now I would understand it, Have a Great New Year's Eve!!!



_______________________
            andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net>[Image removed by sender.]<https://twitter.com/andreschandia>
administrador de
parles.upf.edu<http://parles.upf.edu>
psicoaching.net<http://psicoaching.net>
mapuche koyaktu<http://koyaktumapuche.net>
ong mapuche koyaktu<http://corporacionkoyaktu.net>
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140104/c35bcab3/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ~WRD351.jpg
Type: image/jpeg
Size: 823 bytes
Desc: ~WRD351.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140104/c35bcab3/attachment-0001.jpg>


More information about the CWB mailing list