[CWB] Issues when installing metadata restrictions

Timperley, Matt m.timperley at lancaster.ac.uk
Tue Apr 5 11:14:03 CEST 2016


Hi Giorgina,

Sorry if I'm mistaken about your issue but it looks to me like there is an angle bracket missing from the end of the first line. Just after lang="French". I think it should be: lang="French">.

I hope this helps,
Matt
________________________________________
From: cwb-bounces at sslmit.unibo.it [cwb-bounces at sslmit.unibo.it] on behalf of Giorgina Cerutti Benitez [Giorgina.Cerutti at unige.ch]
Sent: 05 April 2016 09:49
To: cwb at sslmit.unibo.it
Subject: [CWB] Issues when installing metadata restrictions

Hello everyone,

I am writing to you because we are having issues when installing our metadata classifications. We are currently testing metadata installation with corpus T39 (figure 1). Even though we manage to specify our s-attributes (see figure 2), only three of them are recognized as classifications when installing the metadata from the embedded XML (see figure 3); and in other tests none of them is recognized at all (see figure 4).

[cid:image010.jpg at 01D18E88.2B7242E0]
Figure 1:

<text id="test13" period="1" organization="un" category="Monitoring and application" genre="legislative" lang="French"
this
is
a
test
</text>
<text id="test25" period="2" organization="eu" category="Lawmaking" genre="monitoring" lang="Spanish">
this
is
also
a
test
.
thas
worked
</text>
<text id="test26" period="3" organization="wto" category="Adjudication" genre="adjudication" lang="English">
thas
thus
shalala
muajajaja
</text>

[cid:image012.jpg at 01D18E88.2B7242E0]
Figure 2

[cid:image013.jpg at 01D18E88.2B7242E0]
Figure 3

[cid:image015.jpg at 01D18E88.2B7242E0]
Figure 4

We have then tried to install metadata by specifying the desired settings by hand (see figure 5), but we encounter an error (see figure 6).

[cid:image016.jpg at 01D18E88.2B7242E0]
Figure 5

[cid:image020.jpg at 01D18E88.2B7242E0]
Figure 6

The data source you specified for the text metadata contains badly-formatted text ID codes, as follows: <strong> '.'; '</text>'; '<text id="test13" period="1" organization="un" category="Monitoring and application" genre="legislative" lang="French"'; '<text id="test25" period="2" organization="eu" category="Lawmaking" genre="monitoring" lang="Spanish">'; '<text id="test26" period="3" organization="wto" category="Adjudication" genre="adjudication" lang="English">';</strong> (text ids can only contain unaccented letters, numbers, and underscore).

Since we cannot identify the error, we were wondering if any of you has had the same problem (I couldn’t find any thread or information in the manual about this). I would also be grateful if you could tell us if this is a bug or if the system only accepts three classifications.

Thank you very much.

Regards,


Giorgina Cerutti
Assistant
Department of Translation – Spanish Unit
Faculty of Translation and Interpreting
University of Geneva
Office 6242 – Uni Mail
40 bd du Pont d'Arve
CH-1211 Genève 4
[cid:image007.png at 01D1127F.0F2785D0]<https://www.linkedin.com/pub/giorgina-cerutti/20/337/7a0/en>[Facebook]<https://www.facebook.com/UNES.FTI.UNIGE>[Twitter]<https://twitter.com/giorginacerutti>[Transius_EN]<http://transius.unige.ch/en/>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image010.jpg
Type: image/jpeg
Size: 25061 bytes
Desc: image010.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0007.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image012.jpg
Type: image/jpeg
Size: 45182 bytes
Desc: image012.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0008.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image013.jpg
Type: image/jpeg
Size: 65351 bytes
Desc: image013.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0009.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image015.jpg
Type: image/jpeg
Size: 56732 bytes
Desc: image015.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0010.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image016.jpg
Type: image/jpeg
Size: 52193 bytes
Desc: image016.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0011.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image020.jpg
Type: image/jpeg
Size: 36569 bytes
Desc: image020.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0012.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image021.png
Type: image/png
Size: 1425 bytes
Desc: image021.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0003.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image022.png
Type: image/png
Size: 1483 bytes
Desc: image022.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0004.png>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image027.jpg
Type: image/jpeg
Size: 802 bytes
Desc: image027.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0013.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image028.png
Type: image/png
Size: 1306 bytes
Desc: image028.png
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20160405/2d1b0933/attachment-0005.png>


More information about the CWB mailing list