[CWB] Zero matches in BNC

Aleksandar Trklja aleksandar.trklja at univie.ac.at
Fri May 3 16:20:45 CEST 2019


Dear all, 

I've re-encoded BNC with 'EncodeBNC.perl' and 'cqp' now returns zero
matches. It seems that both Positional and Structural Attributes have
been properly encoded (see below) but it seems that the language
variable was not properly assigned. This is what 'info' shows: 

BNC> INFO
Warning:
    Can't open info file /home/corp/tma/.info for reading
Size:    26142145
Charset: latin1
Properties:
        language = '??'
        charset = 'latin1' 

BNC> "THE"
0 matches. 

BNC> SHOW CD 
===Context Descriptor=======================================

left context:     25 characters
right context:    25 characters
corpus position:  shown
target anchors:   not shown

Positional Attributes:  * word
                          pos
                          lemma
                          hw
                          class
                          type
                          flags_before
                          space_after
                          offset

Structural Attributes:    text
                          text_id              [A]
                          text_title           [A]
                          text_n_words         [A]
                          text_n_tokens        [A]
                          text_n_w             [A]
                          text_n_c             [A]
                          text_n_s             [A]
                          text_publication_date [A]
                          text_text_type       [A]
                          text_context         [A]
                          text_respondent_age  [A]
                          text_respondent_class [A]
                          text_respondent_sex  [A]
                          text_interaction_type [A]
                          text_region          [A]
                          text_author_age      [A]
                          text_author_domicile [A]
                          text_author_sex      [A]
                          text_author_type     [A]
                          text_audience_age    [A]
                          text_domain          [A]
                          text_difficulty      [A]
                          text_medium          [A]
                        ...
Any suggestions? Thank you. 

Best 
Aleks
-- 
_Dr Aleksandar Trklja_
_Senior Lecturer_
_Department of Translation Studies_ 
_University of Vienna_
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190503/bdeb4e67/attachment.html>


More information about the CWB mailing list