[CWB] Issues when installing metadata restrictions

Giorgina Cerutti Benitez Giorgina.Cerutti at unige.ch
Tue Apr 5 10:52:52 CEST 2016


Hello everyone,

I am writing to you because we are having issues when installing our metadata classifications. We are currently testing metadata installation with corpus T39 (figure 1). Even though we manage to specify our s-attributes (see figure 2), only three of them are recognized as classifications when installing the metadata from the embedded XML (see figure 3); and in other tests none of them is recognized at all (see figure 4).

[cid:image010.jpg at 01D18E88.2B7242E0]
Figure 1:

<text id="test13" period="1" organization="un" category="Monitoring and application" genre="legislative" lang="French"
this
is
a
test
</text>
<text id="test25" period="2" organization="eu" category="Lawmaking" genre="monitoring" lang="Spanish">
this
is
also
a
test
.
thas
worked
</text>
<text id="test26" period="3" organization="wto" category="Adjudication" genre="adjudication" lang="English">
thas
thus
shalala
muajajaja
</text>

[cid:image012.jpg at 01D18E88.2B7242E0]
Figure 2

[cid:image013.jpg at 01D18E88.2B7242E0]
Figure 3

[cid:image015.jpg at 01D18E88.2B7242E0]
Figure 4

We have then tried to install metadata by specifying the desired settings by hand (see figure 5), but we encounter an error (see figure 6).

[cid:image016.jpg at 01D18E88.2B7242E0]
Figure 5

[cid:image020.jpg at 01D18E88.2B7242E0]
Figure 6

The data source you specified for the text metadata contains badly-formatted text ID codes, as follows: <strong> '.'; '</text>'; '<text id="test13" period="1" organization="un" category="Monitoring and application" genre="legislative" lang="French"'; '<text id="test25" period="2" organization="eu" category="Lawmaking" genre="monitoring" lang="Spanish">'; '<text id="test26" period="3" organization="wto" category="Adjudication" genre="adjudication" lang="English">';</strong> (text ids can only contain unaccented letters, numbers, and underscore).

Since we cannot identify the error, we were wondering if any of you has had the same problem (I couldn't find any thread or information in the manual about this). I would also be grateful if you could tell us if this is a bug or if the system only accepts three classifications.

Thank you very much.

Regards,


Giorgina Cerutti
Assistant
Department of Translation - Spanish Unit
Faculty of Translation and Interpreting
University of Geneva
Office 6242 - Uni Mail
40 bd du Pont d'Arve
CH-1211 Genève 4
[cid:image007.png at 01D1127F.0F2785D0]<https://www.linkedin.com/pub/giorgina-cerutti/20/337/7a0/en>[Facebook]<https://www.facebook.com/UNES.FTI.UNIGE>[Twitter]<https://twitter.com/giorginacerutti>[Transius_EN]<http://transius.unige.ch/en/>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image010.jpg
Type: image/jpeg
Size: 25061 bytes
Desc: image010.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0007.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image012.jpg
Type: image/jpeg
Size: 45182 bytes
Desc: image012.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0008.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image013.jpg
Type: image/jpeg
Size: 65351 bytes
Desc: image013.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0009.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image015.jpg
Type: image/jpeg
Size: 56732 bytes
Desc: image015.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0010.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image016.jpg
Type: image/jpeg
Size: 52193 bytes
Desc: image016.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0011.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image020.jpg
Type: image/jpeg
Size: 36569 bytes
Desc: image020.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0012.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image021.png
Type: image/png
Size: 1425 bytes
Desc: image021.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0003.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image022.png
Type: image/png
Size: 1483 bytes
Desc: image022.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image027.jpg
Type: image/jpeg
Size: 802 bytes
Desc: image027.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0013.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image028.png
Type: image/png
Size: 1306 bytes
Desc: image028.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/10472528/attachment-0005.png>


More information about the CWB mailing list