<html>

  <head>

    <meta content="text/html; charset=KOI8-R" http-equiv="Content-Type">

  </head>

  <body text="#000000" bgcolor="#FFFFFF">

    Hi Igor,<br>

    <br>

    One way to do this in CWB would be to split your<br>

    pos and lemma values in several positionnal attributes.<br>

    For example, in this way :<br>

    form    lemma1    lemma2    pos1    pos2    agr_set1    agr_set2   

    sem_set1    sem_set2<br>

    <br>

    And force your queries to work coherently with<br>

    corresponding attribute sets.<br>

    Your example query would become :<br>

    <span style="white-space: pre;">[lemma1=".*valuelemma.*" &amp;

      pos1=".*valuepos.*"]</span><br>

    <br>

    What do you think ?<br>

    <br>

    Best,<br>

    Serge<br>

    <br>

    <br>

    le 10/07/2012 15:20 Selon Игорь Шалыминов:<br>

    <span style="white-space: pre;">&gt; Hello!<br>

      &gt; <br>

      &gt; My name is Igor, I'm a developer of Russian National Corpus

      search<br>

      &gt; engine, and I'm trying to get it working with CWB. The main

      problem I<br>

      &gt; have is the following: RNC texts are annotated ambiguously

      for the<br>

      &gt; most part, and each word has got sets of lemmas, grammar and

      semantic<br>

      &gt; features, just as the GERMAN-LAW example in the tutorial.

      Suppose we<br>

      &gt; have a word:<br>

      &gt; <br>

      &gt; word lemma pos agr<br>

      &gt; sem <br>

      &gt;

------------------------------------------------------------------------------------------------------------------------<br>

      &gt;<br>

      &gt; </span><br>

    form    |lemma1|lemma2|    |pos1|pos2|    |agr_set1|agr_set2|   

    |sem_set1|sem_set2|<br>

    <span style="white-space: pre;">&gt; <br>

      &gt; And, if I type the query:<br>

      &gt; <br>

      &gt; [(lemma contains "lemma1") and (pos contains "pos2")]<br>

      &gt; <br>

      &gt; I will get that very word matched, and this will be a mistake

      in my<br>

      &gt; case since there is only one strict correspondence: "lemma1

      -&gt; pos1<br>

      &gt; -&gt; arg_set1 -&gt; sem_set1", and the same for lemma2.<br>

      &gt; <br>

      &gt; So, my question, is there an out of the box possibility of

      performing<br>

      &gt; such queries (i.e., controlling positions of corresponding

      sets while<br>

      &gt; matching attribute sets with 'contains'), or it has to be<br>

      &gt; implemented?<br>

      &gt; <br>

      &gt; -- Best Regards, Igor Shalyminov <br>

      &gt; _______________________________________________ CWB mailing

      list <br>

      &gt; <a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a> <br>

      &gt; <a class="moz-txt-link-freetext" href="http://devel.sslmit.unibo.it/mailman/listinfo/cwb">http://devel.sslmit.unibo.it/mailman/listinfo/cwb</a></span><br>

    <br>

    -- <br>

    Dr. Serge Heiden, <a class="moz-txt-link-abbreviated" href="mailto:slh@ens-lyon.fr">slh@ens-lyon.fr</a>, <a class="moz-txt-link-freetext" href="http://textometrie.ens-lyon.fr">http://textometrie.ens-lyon.fr</a><br>

    ENS de Lyon/CNRS - ICAR UMR5191, Institut de Linguistique Fran&ccedil;aise<br>

    15, parvis Ren&eacute; Descartes 69342 Lyon BP7000 Cedex, t&eacute;l.

    +33(0)622003883<br>

    <br>

    <br>

    <br>

    <br>

  </body>

</html>