<html>
  <head>
    <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
  </head>
  <body text="#000000" bgcolor="#FFFFFF">
    Should you need to list them based on the vertical file, you can use
    grep:<br>
    <br>
        grep -oP '(?&lt;=title=").*?(?=")' vss.vrt<br>
    <br>
    David<br>
    <br>
    <div class="moz-cite-prefix">On 11/30/2017 01:49 PM, Martin
      Hammarstedt wrote:<br>
    </div>
    <blockquote type="cite"
      cite="mid:fd22e43b-1a32-7081-8c00-cdcf5409426f@gu.se">
      <meta http-equiv="Content-Type" content="text/html; charset=utf-8">
      <p>Hi,</p>
      <p>You can use the cwb-scan-corpus tool, like this:</p>
      <p>    cwb-scan-corpus CORPUS story_title</p>
      <p>Best regards,<br>
        Martin<br>
      </p>
      <br>
      <div class="moz-cite-prefix">On 2017-11-30 13:38, Hugo SG wrote:<br>
      </div>
      <blockquote type="cite"
cite="mid:CAMSkHSXUTeTDJUhbfZ3Y4rqqH4qviCnZQXNOCcHG72=3w__fSg@mail.gmail.com">
        <div dir="ltr">
          <div>Dear all,<br>
            <br>
          </div>
          I would like to know if there is a way to list all the values
          of a specific attribute. <br>
          <br>
          I mean, using one example of the Corpus Encoding Tutorial
          (Version 3.4) <a
            href="http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf"
            moz-do-not-send="true">[0]</a>, is there any way to list all
          the values of the <b>title </b>attribute of the corpus
          showed in vss.vrt file (page 6) ?<br>
          <br>
          <i><br>
          </i>
          <div style="margin-left:40px"><i>&lt;?xml version="1.0"
              encoding="ISO-8859-1" standalone="yes" ?&gt;<br>
            </i></div>
          <div style="margin-left:40px"><i><br>
            </i></div>
          <div style="margin-left:40px"><i>&lt;!-- A Thrilling
              Experience --&gt;</i></div>
          <div style="margin-left:40px"><i><br>
            </i></div>
          <div style="margin-left:40px"><i>&lt;story num="4"<b> title="A
                Thrilling Experience"</b>&gt;<br>
              &lt;p&gt;<br>
              &lt;s&gt;<br>
              Tick NN tick<br>
              . SENT .<br>
              &lt;/s&gt;<br>
              &lt;s&gt;<br>
              A DT a<br>
              clock NN clock<br>
              . SENT .<br>
              &lt;/s&gt;<br>
              &lt;s&gt;<br>
              Tick VB tick<br>
              , , ,<br>
              tick VB tick<br>
              . SENT .<br>
              &lt;/s&gt;<br>
              &lt;/p&gt;<br>
              ...<br>
              &lt;/story&gt;<br>
            </i></div>
          <br>
          <br>
          <div>I would like to know it because I need to check which
            documents are already present in my corpus. Identifiers of
            the documents are encoded as an attrinute.</div>
          <div><br>
          </div>
          <div> I understand that maybe it is not possible because I
            should know that before, but just in case.<br>
          </div>
          <div><br>
            <div>
              <div>Thank you in advance.<br>
                <br>
                Best regards,<br>
              </div>
              Hugo</div>
            <div><br>
            </div>
            <div>[0] : <a
                href="http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf"
                moz-do-not-send="true">http://cwb.sourceforge.net/files/CWB_Encoding_Tutorial.pdf</a>
              <br>
            </div>
            <div><br>
            </div>
            <div><br>
            </div>
            <div>-- <br>
              <div class="gmail_signature">
                <div>
                  <div>
                    <div>
                      <div>
                        <div><span style="color:rgb(153,153,153)">Hugo
                            Sanjurjo González<br>
                            Personal Investigador en Formación<br>
                            Área de Ingeniería de Sistemas y Automática<br>
                            Dep. Ingeniería Eléctrica y de Sistemas y
                            Automática<br>
                            <br>
                          </span></div>
                        <div><span style="color:rgb(153,153,153)">Facultad
                            de Filosofía y Letras </span><span><span
                              style="color:rgb(153,153,153)">- Dep.
                              Filología Moderna - Despacho 320<br>
                              Campus de Vegazana s/n 24071</span></span></div>
                        <div><span style="color:rgb(153,153,153)">Universidad
                            de León<br>
                            León<br>
                            <br>
                            Tel.<span style="color:rgb(0,0,255)"><u>+34
                                987 291088</u></span></span></div>
                      </div>
                    </div>
                  </div>
                </div>
              </div>
            </div>
          </div>
        </div>
        <br>
        <fieldset class="mimeAttachmentHeader"></fieldset>
        <br>
        <pre wrap="">_______________________________________________
CWB mailing list
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it" moz-do-not-send="true">CWB@sslmit.unibo.it</a>
<a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" moz-do-not-send="true">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a>
</pre>
      </blockquote>
      <br>
      <br>
      <fieldset class="mimeAttachmentHeader"></fieldset>
      <br>
      <pre wrap="">_______________________________________________
CWB mailing list
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a>
<a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a>
</pre>
    </blockquote>
    <br>
  </body>
</html>