<div dir="ltr"><div>Thanks a lot, Stefan! I will upgrade CQP until the end of the year, and will let you know if I get any problems.</div><div>Diana<br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">Stefan Evert <<a href="mailto:stefanML@collocations.de">stefanML@collocations.de</a>> escreveu no dia terça, 8/12/2020 à(s) 15:48:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">In addition to what Andrew explained, you should also (when you can afford the time :) …<br>
<br>
> Thanks a lot. However (maybe this is because I am using a version of cqp which is too old? 3.0.0)<br>
<br>
1) Get a current version of CWB (3.4.27 at the moment). There are a lot of improvements and bug fixes that haven't been ported back to the old 3.0 branch.<br>
<br>
You'll need to check CWB out from the SVN repository and compile from source, but that's not too difficult (internal note: I guess we should provide some instructions on the Web site). Unless you have Ubuntu 20.04 because the install script is broken there.<br>
<br>
> The corpus is encoded with eg.<br>
> <mwe lema=one=example=of lema pos=N><br>
<br>
2) Encode your XML tags as proper XML, i.e. with attribute values quoted:<br>
<br>
<mwe lema="some noun" pos="N"><br>
…<br>
</mwe><br>
<br>
> and created with the flag -V mwe.<br>
<br>
3) Encode with -S mwe:0+lema+pos<br>
<br>
This will split out the annotations on <mwe> tags into separate attributes mwe_lema and mwe_pos; the ":0" checks that your open and close tags are properly balanced and will ignore any nested <mwe> regions (with warnings).<br>
<br>
> However, when I query<br>
> [ ] :: match.mwe="/.*/";<br>
<br>
Then you can directly match lemma and pos<br>
<br>
… :: match.mwe_lema=".+ness" & match.mwe_pos = "N";<br>
<br>
Best,<br>
Stefan<br>
<br>
_______________________________________________<br>
CWB mailing list<br>
<a href="mailto:CWB@sslmit.unibo.it" target="_blank">CWB@sslmit.unibo.it</a><br>
<a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" rel="noreferrer" target="_blank">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a><br>
</blockquote></div>