[CWB] only first tag in text category

Andres Chandia andres at chandia.net
Tue Feb 18 18:43:39 CET 2014



Ok, sorry I have forgot that, so that means that tags at s  level are useless?


El Mar, 18 de Febrero de 2014, 18:25, Hardie, Andrew escribió:
 <style type="text/css">-></style>


Metadata
is at the level of the text. If you extract metadata from the XML associated with a sentence,
you will only get one value per text. 


I’m
pretty sure we covered this issue just before the new year, Andrés, as it happens.

At
that time, I pointed out this warning in the interface:


The following XML
annotations are indexed in the corpus.
Select the ones which
you wish to use as text-metadata fields.
Note: you must only
select annotations that occur at or above
the level of [text]
in the XML hierarchy of your corpus


Cf.
http://devel.sslmit.unibo.it/pipermail/cwb/2013-December/001473.html



Hope
that helps!


best


Andrew.




From:
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of
Andres Chandia
 Sent: 18 February 2014 16:42

To: cwb at sslmit.unibo.it
 Subject: [CWB] only first tag
in text category
 
I'm doing some tests with next files
 this corpus (utf8):
http://parles.upf.edu/llocs/proves/en-es.txt
 this encoding script:
http://parles.upf.edu/llocs/proves/encodecorpus_en-es.sh.txt
 
 then I follow these
steps:
 Install new corpus > Click here to install a corpus you have already indexed
in CWB.
 Specify a MySQL name for this corpus: tr_en_es
 Enter the full name of the
corpus: tr_en_es
 Specify the CWB name (lowercase format): tr_en_es
 
 Install
corpus with setting above >
Design and insert a text-metadata table for the corpus
> Click here to install metadata from within-corpus XML annotation. > use:

text_name "class"
 text_author "class"
 s_id "class" -
"primary"
 s_lang "class"
 
 Yes please,
run this automatically > Create metadata table from XML using the settings above 
 
 Corpus settings > The primary text categorisation scheme is currently: s_id:
"update"
 
 and then at "Manage text categories"
 

for "Categories in classification scheme s_id" it only appears : s_id = s1
(no other ids)
 
 and for "Categories in classification scheme s_lang"
it only appears: s_lang = es  (and not "en")
 
 what I'm doing
wrong?
 
 
 
 _______________________
 andrés
chandía
 
 administrador de
 parles.upf.edu
 psicoaching.net
 mapuche koyaktu
 ong
mapuche koyaktu
 P
No imprima innecesariamente. ¡Cuide
el medio ambiente! 


 


_______________________
            andrés
chandía

administrador de
parles.upf.edu
psicoaching.net
mapuche koyaktu
ong mapuche koyaktu
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140218/78f4e0a5/attachment.html>


More information about the CWB mailing list