<div dir="ltr">Hello everyone,<div><br></div><div>I am using CQPwebinabox and I have indexed a Traditonal Chinese corpus called "canton1" by using two commands:</div><div><br></div><div><div>sudo cwb-encode -d /usr/local/corpora/data/canton1 -f /home/user/Desktop/corpora/canton1/canton1.vrt -R /usr/local/share/cwb/registry/canton1 -c utf8 -xsB -P pos -P lemma -S s:0 -S text:0+id</div><div><br></div><div><div>sudo cwb-make -V CANTON1</div></div><div><br></div><div>After that, I install the corpus onto CQPweb. Most of the thing are correct. However, the total number of corpus texts is as same as the total words in all corpus texts.</div><div><br></div><div>Excerpt of the vrt file of "canton1"(those spaces are tab):</div><div><br></div><div><div><text id="T01"></div><div><s></div><div>中環<span style="white-space:pre">        </span>N<span style="white-space:pre">        </span>中環</div><div>保育<span style="white-space:pre">        </span>V<span style="white-space:pre">        </span>保育</div><div>奇觀<span style="white-space:pre">        </span>N<span style="white-space:pre">        </span>奇觀</div><div></s></div><div></text></div></div><div><br></div><div>Excerpt of metadata file of "canton1":</div><div><br></div><div><div>T01<span style="white-space:pre">        </span>beginning</div><div>T02<span style="white-space:pre">        </span>ending</div></div><div><br></div><div>How to fix this problem? Thank you</div><div><br></div><div>Best regards from Hong Kong,</div><div>Lai</div><br>
</div></div>