<div class="markdown_content"><ul>
<li><strong>labels</strong>: --> CQPweb</li>
<li><strong>assigned_to</strong>: Andrew Hardie</li>
</ul>
<hr/>
<p><strong> <a class="alink" href="https://sourceforge.net/p/cwb/feature-requests/57/">[feature-requests:#57]</a> CQPweb: Apply annotation and XML templates to pre-indexed CWB corpus</strong></p>
<p><strong>Status:</strong> open<br/>
<strong>Group:</strong> TODO-3.5<br/>
<strong>Labels:</strong> CQPweb <br/>
<strong>Created:</strong> Tue Jul 30, 2019 10:45 AM UTC by Stefan Evert<br/>
<strong>Last Updated:</strong> Tue Jul 30, 2019 10:45 AM UTC<br/>
<strong>Owner:</strong> Andrew Hardie</p>
<p>When Installating a corpus that's already indexed in CWB, corpus annotation (p-attributes) and XML structure (s-attributes) have to be set up manually. This process would be a lot less cumbersome if annotation and XML templates could be used to assign names and data types.</p>
<p>The implementation should be relatively easy: it would look up relevant information in the template by attribute name and either ignore everything else or complain in the case of a mismatch.</p>
<p>Motivation: Sometimes indexing a corpus via the Web admin interface is undesirable, in particular (i) for very large corpora (>> 100 M words), where uploading the .vrt file may not be possible and indexing will take a very long time; and (ii) if the corpus does not exist as a .vrt file in the first place, i.e. if annotation (both p-attributes and s-attributes) has been added to the CWB-indexed version (and CWB cannot export the .vrt format expected by CQPweb).</p>
<hr/>
<p>Sent from sourceforge.net because cwb@sslmit.unibo.it is subscribed to <a href="https://sourceforge.net/p/cwb/feature-requests/">https://sourceforge.net/p/cwb/feature-requests/</a></p>
<p>To unsubscribe from further messages, a project admin can change settings at <a href="https://sourceforge.net/p/cwb/admin/feature-requests/options.">https://sourceforge.net/p/cwb/admin/feature-requests/options.</a> Or, if this is a mailing list, you can unsubscribe from the mailing list.</p></div>