[CWB] Word list

Teresa Molés Cases teresamoles at gmail.com
Tue Jul 25 20:08:09 CEST 2017


Thanks a lot for all your help!

Best,

Teresa

> El 25 jul 2017, a las 16:11, Stefan Evert <stefanML at collocations.de> escribió:
> 
> 
>> On 25 Jul 2017, at 09:57, Andrés Chandía <andres at chandia.net> wrote:
>> 
>> No sorry, I didn't mean at the cqp interface, but going by terminal to a linux shell..
> 
> With a CWB-encoded corpus, it's fastest to do 
> 
> 	cwb-lexdecode -f -s CORPUS
> 
> or e.g. for a lemma attribute
> 
> 	cwb-lexdecode -P lemma -f -s CORPUS
> 
> If you need more complex frequency counts, e.g. word/pos combinations, you should take a look at the cwb-scan-corpus tool described in the CWB Corpus Encoding Tutorial.
> 
> 
> In principle, it's possible to obtain a full frequency count in CQP with a dummy query that matches every single token:
> 
> 	> AllWords = [];
> 	> group AllWords match word;
> 
> but this is extremely inefficient and uses huge amounts of memory, so don't do that. :-)
> 
> Best,
> Stefan





More information about the CWB mailing list