[CWB] Problem with -R function (Cygwin)

Aleksandar Trklja A.Trklja at bham.ac.uk
Tue Jul 4 13:22:17 CEST 2017


Dear all,

I've recently installed the Windows version of 'cwb' but I don't manage to encode any corpus because of a problem with the -R command. I use Cygwin.

When I specify the name of a new directory I receive an error message that the corpus ID is not valid and that there is an error on the last line in my file. This happens regardless of what my source text is.  Interestingly, -R does create a registry file but it is empty.

$ cwb-encode.exe -d corpus/data/opn -f text.txt -R corpus/reg/opn -P pos -P lemma -S s:0 -c utf8
corpus/reg/opn is not a valid corpus ID! Can't create registry entry.
[location of error: file text.txt, line #27]


If I define the standard registry directory with 'export CORPUS_REGISTRY=' I get the message that my registry doesn't exist although the registry does exist.


$ cwb-encode.exe -d corpus/data/opn -f text.txt -R /cygdrive/c/cwb/registry/opn -P pos -P lemma -S s:0 -c utf8
Error: registry directory '/cygdrive/c/cwb/registry' does not exist.
Please create this directory first.

I should say that the encoding of the same texts on another machine with Cygwin-based 'cwb' works just fine.

This is an example of a text I've tried to encode:
<s>
The DT the
nineteenth JJ nineteenth
century NN century
was VBD be
, , ,
until IN until
recently RB recently
, , ,
predominantly RB predominantly
seen VVN see
as IN as
a DT a
century NN century
of IN of
rapid JJ rapid
industrialisation NN industrialisation
which WDT which
set VVD set
the DT the
stage NN stage
for IN for
profound JJ profound
social JJ social
change NN change
. SENT .
</s>



Many thanks.

Best wishes
Aleksandar

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20170704/ddb75d81/attachment.html>


More information about the CWB mailing list