<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
</head>
<body text="#000000" bgcolor="#FFFFFF">
Should you need to list them based on the vertical file, you can use
grep:<br>
<br>
grep -oP '(?<=title=").*?(?=")' vss.vrt<br>
<br>
David<br>
<br>
<div class="moz-cite-prefix">On 11/30/2017 01:49 PM, Martin
Hammarstedt wrote:<br>
</div>
<blockquote type="cite"
cite="mid:fd22e43b-1a32-7081-8c00-cdcf5409426f@gu.se">
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<p>Hi,</p>
<p>You can use the cwb-scan-corpus tool, like this:</p>
<p> cwb-scan-corpus CORPUS story_title</p>
<p>Best regards,<br>
Martin<br>
</p>
<br>
<div class="moz-cite-prefix">On 2017-11-30 13:38, Hugo SG wrote:<br>
</div>
<blockquote type="cite"
cite="mid:CAMSkHSXUTeTDJUhbfZ3Y4rqqH4qviCnZQXNOCcHG72=3w__fSg@mail.gmail.com">
<div dir="ltr">
<div>Dear all,<br>
<br>
</div>
I would like to know if there is a way to list all the values
of a specific attribute. <br>
<br>
I mean, using one example of the Corpus Encoding Tutorial
(Version 3.4) <a
href="http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf"
moz-do-not-send="true">[0]</a>, is there any way to list all
the values of the <b>title </b>attribute of the corpus
showed in vss.vrt file (page 6) ?<br>
<br>
<i><br>
</i>
<div style="margin-left:40px"><i><?xml version="1.0"
encoding="ISO-8859-1" standalone="yes" ?><br>
</i></div>
<div style="margin-left:40px"><i><br>
</i></div>
<div style="margin-left:40px"><i><!-- A Thrilling
Experience --></i></div>
<div style="margin-left:40px"><i><br>
</i></div>
<div style="margin-left:40px"><i><story num="4"<b> title="A
Thrilling Experience"</b>><br>
<p><br>
<s><br>
Tick NN tick<br>
. SENT .<br>
</s><br>
<s><br>
A DT a<br>
clock NN clock<br>
. SENT .<br>
</s><br>
<s><br>
Tick VB tick<br>
, , ,<br>
tick VB tick<br>
. SENT .<br>
</s><br>
</p><br>
...<br>
</story><br>
</i></div>
<br>
<br>
<div>I would like to know it because I need to check which
documents are already present in my corpus. Identifiers of
the documents are encoded as an attrinute.</div>
<div><br>
</div>
<div> I understand that maybe it is not possible because I
should know that before, but just in case.<br>
</div>
<div><br>
<div>
<div>Thank you in advance.<br>
<br>
Best regards,<br>
</div>
Hugo</div>
<div><br>
</div>
<div>[0] : <a
href="http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf"
moz-do-not-send="true">http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf</a>
<br>
</div>
<div><br>
</div>
<div><br>
</div>
<div>-- <br>
<div class="gmail_signature">
<div>
<div>
<div>
<div>
<div><span style="color:rgb(153,153,153)">Hugo
Sanjurjo González<br>
Personal Investigador en Formación<br>
Área de Ingeniería de Sistemas y Automática<br>
Dep. Ingeniería Eléctrica y de Sistemas y
Automática<br>
<br>
</span></div>
<div><span style="color:rgb(153,153,153)">Facultad
de Filosofía y Letras </span><span><span
style="color:rgb(153,153,153)">- Dep.
Filología Moderna - Despacho 320<br>
Campus de Vegazana s/n 24071</span></span></div>
<div><span style="color:rgb(153,153,153)">Universidad
de León<br>
León<br>
<br>
Tel.<span style="color:rgb(0,0,255)"><u>+34
987 291088</u></span></span></div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
</div>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
CWB mailing list
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it" moz-do-not-send="true">CWB@sslmit.unibo.it</a>
<a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" moz-do-not-send="true">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a>
</pre>
</blockquote>
<br>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<br>
<pre wrap="">_______________________________________________
CWB mailing list
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a>
<a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a>
</pre>
</blockquote>
<br>
</body>
</html>