<html><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /></head><body style='font-size: 10pt; font-family: Verdana,Geneva,sans-serif'>
<p>There are two things: Format conversion and the addition of annotations (in the example Part of Speech and Lemma). There are tools like TreeTagger that do both tasks in one run.</p>
<p>Adding the <s> structure is another independent step, we use some simple script written by ourselves to add those.</p>
<p><br /></p>
<p>--Jörg Knappen</p>
<p><br /></p>
<p id="reply-intro">Am 2020-11-05 10:43, schrieb YANG CHRICS:</p>
<blockquote type="cite" style="padding: 0 0.4em; border-left: #1010ff 2px solid; margin: 0">
<div id="replybody1">
<div dir="ltr">hi, I come here to ask for help again. I couldn't figure out how to install a pure text corpus through CQPweb. Today I read the encoding manual again, it seems that I have to change the text to CWB input format ( one-word-per-line text, just like the picture below). It would be grateful, if you can tell how to change the plain text into the following format, thank you.
<div><br />
<div><img src="cid:16045698345fa3caea3c4e2688259192@mx.uni-saarland.de" alt="img3.png" width="224" height="141" /></div>
)</div>
</div>
</div>
<br />
<div class="pre" style="margin: 0; padding: 0; font-family: monospace">_______________________________________________<br />CWB mailing list<br /><a href="mailto:CWB@sslmit.unibo.it">CWB@sslmit.unibo.it</a><br /><a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" target="_blank" rel="noopener noreferrer">http://liste.sslmit.unibo.it/mailman/listinfo/cwb</a></div>
</blockquote>
<p><br /></p>
</body></html>