<html>
<head>
<meta http-equiv="Content-Type" content="text/html; charset=UTF-8">
</head>
<body>
<p>OK. Thanks. That's what the problem seems to be. The ! operator
can have scope over "words" (i.e anything enclosed within '[]')
even if the | operator is used to establish possible alternatives
for the word form. It cannot be used, however, with "word" groups
enclosed within '( )'. <br>
<br>
This makes sense at a conceptual level. <br>
</p>
<p>Once we have the option of using parenthesis to create what are
in effect multi-word expressions in CQP searches, though, I think
it would make sense to also be able to use the ! operator to
handle those expressions as if they were words. I can see how this
could be enormously useful for people working with corpora. <br>
</p>
<p>JM<br>
</p>
<div class="moz-cite-prefix">On 29/07/2020 11:52, "Andrés Chandía"
wrote:<br>
</div>
<blockquote type="cite"
cite="mid:38f637f4358a8c12d605db1b7144704c.squirrel@mail.chandia.net">
<meta http-equiv="content-type" content="text/html; charset=UTF-8">
<div>No, I didn't suggest waht you say, I was just calling your
attention to the difference
between your RegEx and that from the manual...</div>
<div><br>
</div>
<div>Manual<br>
</div>
<div>[(lemma="go") & !(word="went"%c | word="gone"%c)];</div>
<div> </div>
<div><br>
</div>
<div>Yours</div>
<div>([word="en"][word="tierra"])</div>
<div><br>
</div>
<div>to match yours to the manual one, regex should be:
!(word="en"
word="tierra")</div>
<div><br>
</div>
<div><br>
</div>
<div><br>
</div>
El Mar, 28 de Julio de 2020, 23:42, Josep M. Fontana escribió:<br>
> Thanks Maarten
and to everybody who responded.<br>
> <br>
> Yes. What you say makes total sense. I
had assumed that since in essence<br>
> the complex pattern involving grouped expressions
are all within a<br>
> single parenthesis '( )' and that allows one to treat it as
if it
were a<br>
> single word within '[]', the ! operator would work the same
way it works<br>
> when it is associated to any expression inside square
brackets.<br>
> <br>
>
Andrés seems to suggest to enclose everything within square
brackets but<br>
> that
doesn't work. In principle it shouldn't word because the
convention<br>
> is that square
brackets enclose a word. So it makes sense that we can't<br>
> do that. Once we allow to
use parenthesis to form groups of sequences of<br>
> words that are treated essentially as
a single unit, however, why can't<br>
> we use the same operators we use with single
expressions enclosed within<br>
> '[ ]'? I don't see why it shouldn't be possible.<br>
> <br>
> If no one has asked this before it must mean that there are
not that<br>
> many people who would need to do this kind of search and of
course I<br>
> have no
idea of how hard this might be to implement. Having said this,<br>
> however, I certainly
think that this would be very useful. I find the<br>
> idea of doing a diff as Andrew
suggests a bit unpractical.<br>
> <br>
> JM<br>
> <br>
> On 28/07/2020 22:41,
Maarten Janssen wrote:<br>
>> I did not look in detail at the implementation in CWB -
but if these were normal regular<br>
>> expressions, your query<br>
>><br>
>> [(word="f[ei]rid.*")|(word="muert[ao].*")]
!(([(pos="S.*")<br>
>> &<br>
>>
(word="d.*")][word=".*el"][word="ca[buv]allo.*"])|[word="entierra"]|([word="en"][word="tierra"]))<br>
>><br>
>> should match<br>
>><br>
>> cayo *muerto* en tierra<br>
>><br>
>> Namely - “muerto” for the first part of the query, and
nothing for the second - there is no<br>
>> indication of how long the second part
should be - add a word requirement after it and it<br>
>> even becomes unwelldefined
what you would mean by it; it would be different if you were<br>
>> looking for a
specific word after it that cannot be one of several, like
[!(word=“en<br>
>>
?tierra” | word=“ca[buv]allo")] - but your second part has a
variable word
length. What you<br>
>> are looking for is a negative look-ahead, which you
cannot do
by negating the parts of what<br>
>> you are looking for - and given how query matches
work in CWB I would be very surprised if<br>
>> there is a negative look-ahead...<br>
>> _______________________________________________<br>
>> CWB mailing list<br>
>> <a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a><br>
>>
<a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a><br>
>
_______________________________________________<br>
> CWB mailing list<br>
>
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a><br>
> <a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a><br>
> <br>
<br>
<br>
<br>
_______________________<br>
andrés
chandía<br>
<a href="http://www.chandia.net" target="_blank"
moz-do-not-send="true"><img
src="http://mail.chandia.net/images/chandia_net.png"
alt="chandia.net" moz-do-not-send="true" border="0"></a> <a
href="https://twitter.com/chandianet" target="_blank"
moz-do-not-send="true"><img
src="http://mail.chandia.net/images/ico_tw.png" alt=""
moz-do-not-send="true" width="20" height="20"></a><br>
<a href="http://chandia.net/content/dungupeyem" title="Analizador
y generador mofológico de
mapudüngun" moz-do-not-send="true">Düngupeyem</a> | <a
href="http://chandia.net/content/iecmap" title="Interfaz
de explotación de corpus del mapudüngun" moz-do-not-send="true">IECMap</a>
| <a href="http://chandia.net/content/isecmap" title="Interfaz
sencilla de explotación de corpus
del mapudüngun" moz-do-not-send="true">ISECMap</a> | <a
href="http://chandia.net/content/nmt" title="Unificador
ortográfico de mapudüngun" moz-do-not-send="true">NMT</a> | <a
href="http://corlexim.cl" title="Corpus lexicográfico
de mapudüngun" moz-do-not-send="true">Corlexim</a><br>
<br>
Desarrollador de:<br>
<a href="https://parles.upf.edu" moz-do-not-send="true">Parles.upf</a>
| <a href="https://iwch.upf.edu" title="Interfície Web de
Consulta d'Horaris UCA TCL UPF" moz-do-not-send="true">IWCH</a>
| <a href="http://amindterapia.com" moz-do-not-send="true">Amind
terapia</a> | <a href="http://koyaktumapuche.net"
moz-do-not-send="true">ONG
Mapuche koyaktu</a> | <a
href="https://parles.upf.edu/llocs/nocando"
moz-do-not-send="true">Nocando</a> | <a
href="https://iac.upf.edu" moz-do-not-send="true">IAC</a> | <a
href="https://iac.upf.edu/cddz" moz-do-not-send="true">CddZ</a>
| <a href="https://iac.upf.edu/isac" moz-do-not-send="true">ISAC</a>
| <a href="http://catcg.upf.edu" moz-do-not-send="true">CatCg</a><br>
<span style="font-size: 18pt; color: rgb(79, 98, 40); font-family:
Webdings;">P</span> <span style="font-size: 10pt; color:
rgb(79, 98, 40);">No imprima innecesariamente. ¡Cuide el
medio ambiente!</span>
<br>
<fieldset class="mimeAttachmentHeader"></fieldset>
<pre class="moz-quote-pre" wrap="">_______________________________________________
CWB mailing list
<a class="moz-txt-link-abbreviated" href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a>
<a class="moz-txt-link-freetext" href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a>
</pre>
</blockquote>
</body>
</html>