[CWB] Difference in token number between CQP and CQPweb
Hannah Kermes
h.kermes at mx.uni-saarland.de
Fri Feb 14 12:07:57 CET 2014
Hi,
I just realized a difference in the token numbers between CQP and CQPweb.
The encoded corpus in CQPweb is a copy of the CQP corpus. The encoding
has been performed with CQP on the command line and has been installed
in CQPweb as an encoded corpus.
Token numbers: 1,961,752 (CQPweb); 2,076,963 (CQP)
The difference is also present if you look at subcorpora.
Does someone have an explanation for this?
Best
Hannah
--
Dr. Hannah Kermes
Dept. of Applied Linguistics, Interpreting and Translation (FR 4.6)
Universität des Saarlandes
Campus, Building A2.2, Room 1.07
D-66123 Saarbrücken
phone: +49-(0)681-302-70077
More information about the CWB
mailing list