[CWB] Greedy token match

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu May 21 11:23:33 CEST 2015


“Greediness” is not a thing in CQP, instead there is matching strategy, as described here:

http://cwb.sourceforge.net/files/CQP_Tutorial/node30.html

best

Andrew

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Maarten Janssen
Sent: 21 May 2015 10:21
To: cwb at sslmit.unibo.it
Subject: [CWB] Greedy token match

Dear all,

There seem to be a problem with greedy matches on tokens in CQP - when running a query on stretches on PNM (part of name) in a corpus we have, you would expect the full matching sequences using greedy repitition on tokens, but you do seem not to, at least not in the queries I run:

TT-CRPCGS> [pos=".*_PNM"]+;
...
       73: e querem defender . O Sr. <Alberto> Martins ( PS ) : - Muito
       74:  defender . O Sr. Alberto <Martins> ( PS ) : - Muito bem ! O
...

Since token 73 and 74 both match, they should never have been individual matches with a greedy search; does anybody have any idea why this would happen, and how to solve it?

Maarten
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20150521/36c92c5b/attachment.html>


More information about the CWB mailing list