[CWB] Zero matches in BNC: Resolved

Aleksandar Trklja aleksandar.trklja at univie.ac.at
Fri May 17 08:52:29 CEST 2019


Dear Andrew,

There were some admin-related issues because it is a university served. 
Thank you for your help.

Best wishes
A

Am 09.05.2019 11:58, schrieb Hardie, Andrew:
> This is the key error:
> 
> /usr/local/share/cwb/registry/bnc: Permission denied
> 
> The username you are running under doesn’t have permission to create
> a file in that folder. Perhaps you need to change the permissions, or
> else specify a different registry.
> 
> best
> 
> Andrew.
> 
> FROM: Aleksandar Trklja <aleksandar.trklja at univie.ac.at>
> SENT: 09 May 2019 09:29
> TO: Open source development of the Corpus WorkBench
> <cwb at sslmit.unibo.it>
> CC: Hardie, Andrew <a.hardie at lancaster.ac.uk>
> SUBJECT: Re: [CWB] Zero matches in BNC
> 
> Hi Andrew,
> 
> Thank you so much for your clarification.
> 
> "/home/corp/tma/" is actually a directory. It seems this problem arose
> because I tried to encode an already existing directory. After I've
> created a new directory that error message didn't appear.
> 
> But, now I get the following error message at the end of the encoding
> process:
> 
> ...
> Building indices and compressing data ...
>       3 <list> regions dropped because of deep nesting.
>      14 <item> regions dropped because of deep nesting.
>       8 <hi> regions dropped because of deep nesting.
>       3 <p> regions dropped because of deep nesting.
> /usr/local/share/cwb/registry/bnc: Permission denied
> Can't create registry entry in file /usr/local/share/cwb/registry/bnc!
> [location of error: input line #81145224]
> CWB::Encoder: Error in cwb-encode pipe ().
> 
> A
> 
> Am 03.05.2019 20:21, schrieb Hardie, Andrew:
> 
>> Hi Aleks,
>> 
>> The language variable is not really relevant. It not being set means
>> nothing. The size would seem t be wrong, though, as 26 million is
>> nowhere near enough. Something may have gone wrong in the encoding
>> process at that point that has left the lexicon and/or the index
>> unfinished (thus the search failure).
>> 
>> Also, is your BNC data directory actually /home/corp/tma/ or is it a
>> subdirectory of that? The latter would indicate something amiss if
>> CQP
>> is looking for the .info file (which usually doesn’t exist) in the
>> parent directory. You might check what paths are given in the
>> registry
>> file, perhaps.
>> 
>> Hope that helps
>> 
>> best
>> 
>> Andrew.
>> 
>> FROM: cwb-bounces at sslmit.unibo.it <cwb-bounces at sslmit.unibo.it> ON
>> BEHALF OF Aleksandar Trklja
>> SENT: 03 May 2019 15:21
>> TO: cwb at sslmit.unibo.it
>> SUBJECT: [CWB] Zero matches in BNC
>> IMPORTANCE: High
>> 
>> Dear all,
>> 
>> I've re-encoded BNC with 'EncodeBNC.perl' and 'cqp' now returns zero
>> matches. It seems that both Positional and Structural Attributes
>> have
>> been properly encoded (see below) but it seems that the language
>> variable was not properly assigned. This is what 'info' shows:
>> 
>> BNC> INFO
>> Warning:
>> Can't open info file /home/corp/tma/.info for reading
>> Size:    26142145
>> Charset: latin1
>> Properties:
>> language = '??'
>> charset = 'latin1'
>> 
>> BNC> "THE"
>> 0 matches.
>> 
>> BNC> SHOW CD
>> 
>> ===Context Descriptor=======================================
>> 
>> left context:     25 characters
>> right context:    25 characters
>> corpus position:  shown
>> target anchors:   not shown
>> 
>> Positional Attributes:  * word
>> pos
>> lemma
>> hw
>> class
>> type
>> flags_before
>> space_after
>> offset
>> 
>> Structural Attributes:    text
>> text_id              [A]
>> text_title           [A]
>> text_n_words         [A]
>> text_n_tokens        [A]
>> text_n_w             [A]
>> text_n_c             [A]
>> text_n_s             [A]
>> text_publication_date [A]
>> text_text_type       [A]
>> text_context         [A]
>> text_respondent_age  [A]
>> text_respondent_class [A]
>> text_respondent_sex  [A]
>> text_interaction_type [A]
>> text_region          [A]
>> text_author_age      [A]
>> text_author_domicile [A]
>> text_author_sex      [A]
>> text_author_type     [A]
>> text_audience_age    [A]
>> text_domain          [A]
>> text_difficulty      [A]
>> text_medium          [A]
>> ...
>> Any suggestions? Thank you.
>> 
>> Best
>> 
>> Aleks
>> 
>> --
>> 
>> _Dr Aleksandar Trklja_
>> _Senior Lecturer_
>> _Department of Translation Studies_
>> 
>> _University of Vienna_
>> _______________________________________________
>> CWB mailing list
>> CWB at sslmit.unibo.it
>> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
> 
> --
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb

-- 
_Dr Aleksandar Trklja_
_Senior Lecturer_
_Department of Translation Studies_
_University of Vienna_


More information about the CWB mailing list