[CWB] Difference in token number between CQP and CQPweb

Hannah Kermes h.kermes at mx.uni-saarland.de
Fri Feb 14 12:07:57 CET 2014


Hi,

I just realized a difference in the token numbers between CQP and CQPweb.
The encoded corpus in CQPweb is a copy of the CQP corpus. The encoding 
has been performed with CQP on the command line and has been installed 
in CQPweb as an encoded corpus.

Token numbers: 1,961,752 (CQPweb); 2,076,963 (CQP)

The difference is also present if you look at subcorpora.

Does someone have an explanation for this?

Best
Hannah

-- 
Dr. Hannah Kermes
Dept. of Applied Linguistics, Interpreting and Translation (FR 4.6)
Universität des Saarlandes
Campus, Building A2.2, Room 1.07
D-66123 Saarbrücken
phone: +49-(0)681-302-70077



More information about the CWB mailing list