[CWB] Generating list of POS tags used in a corpus

Hardie, Andrew a.hardie at lancaster.ac.uk
Sun Jul 22 12:19:10 CEST 2012


cwb-lexdecode -sf -r /your/registry -P name-of-attribute YOUR-CORPUS 

best

Andrew.

-----Original Message-----
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Josep M. Fontana
Sent: 22 July 2012 10:41
To: cwb at sslmit.unibo.it
Subject: [CWB] Generating list of POS tags used in a corpus

Hi,

I was wondering whether there is a CQP command (or some other way in
CWB) to generate a list of the pos tags that have been used in a corpus. 
I am exploting some corpora for which I don't have any documentation and I would like to have a clear idea about what tagset has been used and what has been encoded in the different tags.

I've searched through the last versions of the CQP language tutorial and the Corpus encoding tutorial but I haven't been able to find anything relevant. Any help will be greatly appreciated.

Josep M.
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
http://devel.sslmit.unibo.it/mailman/listinfo/cwb


More information about the CWB mailing list