<html><head><meta http-equiv="Content-Type" content="text/html; charset=UTF-8" /></head><body style='font-size: 10pt; font-family: Verdana,Geneva,sans-serif'>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace">Dear all,</div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace">I've re-encoded BNC with 'EncodeBNC.perl' and 'cqp' now returns zero matches. It seems that both Positional and Structural Attributes have been properly encoded (see below) but it seems that the language variable was not properly assigned. This is what 'info' shows:</div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"><strong>BNC> info</strong><br />Warning:<br /> Can't open info file /home/corp/tma/.info for reading<br />Size: 26142145<br />Charset: latin1<br />Properties:<br /> language = '??'<br /> charset = 'latin1'</div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"><strong>BNC> "the"</strong><br />0 matches.</div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"><strong>BNC> show cd</strong></div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace">===Context Descriptor=======================================<br /><br />left context: 25 characters<br />right context: 25 characters<br />corpus position: shown<br />target anchors: not shown<br /><br />Positional Attributes: * word<br /> pos<br /> lemma<br /> hw<br /> class<br /> type<br /> flags_before<br /> space_after<br /> offset<br /><br />Structural Attributes: text<br /> text_id [A]<br /> text_title [A]<br /> text_n_words [A]<br /> text_n_tokens [A]<br /> text_n_w [A]<br /> text_n_c [A]<br /> text_n_s [A]<br /> text_publication_date [A]<br /> text_text_type [A]<br /> text_context [A]<br /> text_respondent_age [A]<br /> text_respondent_class [A]<br /> text_respondent_sex [A]<br /> text_interaction_type [A]<br /> text_region [A]<br /> text_author_age [A]<br /> text_author_domicile [A]<br /> text_author_sex [A]<br /> text_author_type [A]<br /> text_audience_age [A]<br /> text_domain [A]<br /> text_difficulty [A]<br /> text_medium [A]<br /> ...<br /> Any suggestions? Thank you.</div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"> </div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace">Best</div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace">Aleks<br />
<div>-- <br />
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"><em>Dr Aleksandar Trklja</em><br /><em>Senior Lecturer</em><br /><em>Department of Translation Studies</em></div>
<div class="pre" style="margin: 0; padding: 0; font-family: monospace"><em>University of Vienna</em></div>
</div>
</div>
</body></html>