[CWB] expanding positional attributes

Hardie, Andrew a.hardie at lancaster.ac.uk
Fri Mar 25 15:50:38 CET 2011


Hi Luigi,
 
Can you specify in a bit more detail what kind of queries you would like to be able to do but can't?
 
best
 
Andrew.

________________________________

From: cwb-bounces at sslmit.unibo.it on behalf of luigi.talamo at libero.it
Sent: Fri 25/03/2011 14:00
To: cwb at sslmit.unibo.it
Subject: [CWB] expanding positional attributes



Dear list,
I was wondering whether is possible to make a better use of the tagset
provided in an annotation.
For example, if I have two separate tags for a finite verb (VER:fin) and
infinitive verb (VER:infi), as in La Repubblica corpus, it is possible to set a
query as following:

[lemma="disegnare" & pos="VER.*"]

in order to catch both the finite and infinitive form of the verb disegnare.

We're making use of wildcards (and `colon-ed' values for representing
morphosemantic features, as introduced by Marco Baroni for this tagset) as a
workaround for a non-expandable, mono-dimensional value of the positional
attributes: it would be nice to have a nesting-capable value of these
attributes, as (AFAIK) in structural attributes. Here I'm proposing a sort of
hierarchical structure for a token-level annotation, which could turn useful to
people making, for instance, a morphological annotation, as in the example
above.
What do you think?

Best regards,

Luigi


_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
http://devel.sslmit.unibo.it/mailman/listinfo/cwb


-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/ms-tnef
Size: 4498 bytes
Desc: not available
Url : http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20110325/4ebcdaa6/attachment.bin


More information about the CWB mailing list