[CWB] only first tag in text category

Hardie, Andrew a.hardie at lancaster.ac.uk
Tue Feb 18 18:49:28 CET 2014


You can use them in syntax queries just as you could in command-line CQP. It's also possible to designate some s-level value as the "line number" for the concordance.
(Manage visualisation > (4)Position labels > choose s_id > Update setting). You can also visualise them in concordances, in a limited way (the tags appear but are not rendered according to the rules you specify, cos that bit's not finished.)

best

Andrew.

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Andres Chandia
Sent: 18 February 2014 17:44
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] only first tag in text category

Ok, sorry I have forgot that, so that means that tags at s  level are useless?


El Mar, 18 de Febrero de 2014, 18:25, Hardie, Andrew escribió:
Metadata is at the level of the text. If you extract metadata from the XML associated with a sentence, you will only get one value per text.
I'm pretty sure we covered this issue just before the new year, Andrés, as it happens.
At that time, I pointed out this warning in the interface:
The following XML annotations are indexed in the corpus.
Select the ones which you wish to use as text-metadata fields.
Note: you must only select annotations that occur at or above
the level of [text] in the XML hierarchy of your corpus
Cf. http://devel.sslmit.unibo.it/pipermail/cwb/2013-December/001473.html
Hope that helps!
best
Andrew.
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Andres Chandia
Sent: 18 February 2014 16:42
To: cwb at sslmit.unibo.it
Subject: [CWB] only first tag in text category
I'm doing some tests with next files
this corpus (utf8): http://parles.upf.edu/llocs/proves/en-es.txt
this encoding script: http://parles.upf.edu/llocs/proves/encodecorpus_en-es.sh.txt

then I follow these steps:
Install new corpus > Click here to install a corpus you have already indexed in CWB.
Specify a MySQL name for this corpus: tr_en_es
Enter the full name of the corpus: tr_en_es
Specify the CWB name (lowercase format): tr_en_es

Install corpus with setting above >
Design and insert a text-metadata table for the corpus > Click here to install metadata from within-corpus XML annotation. > use:
text_name "class"
text_author "class"
s_id "class" - "primary"
s_lang "class"

Yes please, run this automatically > Create metadata table from XML using the settings above

Corpus settings > The primary text categorisation scheme is currently: s_id: "update"

and then at "Manage text categories"

for "Categories in classification scheme s_id" it only appears : s_id = s1 (no other ids)

and for "Categories in classification scheme s_lang" it only appears: s_lang = es (and not "en")

what I'm doing wrong?



_______________________
andrés chandía
[IMAGE REMOVED]<http://www.chandia.net>
administrador de
parles.upf.edu<http://parles.upf.edu>
psicoaching.net<http://psicoaching.net>
mapuche koyaktu<http://koyaktumapuche.net>
ong mapuche koyaktu<http://corporacionkoyaktu.net>
P No imprima innecesariamente. ¡Cuide el medio ambiente!



_______________________
            andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net>[Image removed by sender.]<https://twitter.com/andreschandia>
administrador de
parles.upf.edu<http://parles.upf.edu>
psicoaching.net<http://psicoaching.net>
mapuche koyaktu<http://koyaktumapuche.net>
ong mapuche koyaktu<http://corporacionkoyaktu.net>
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140218/537695d6/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ~WRD000.jpg
Type: image/jpeg
Size: 823 bytes
Desc: ~WRD000.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140218/537695d6/attachment-0001.jpg>


More information about the CWB mailing list