[CWB] Indexing recursively
Thomas Zastrow
thomas.zastrow at rzg.mpg.de
Fri Aug 25 14:47:46 CEST 2017
Thanks, but this gives me also 0 results:
cqp -r /data/wp/2017/data/cqp/registry
[no corpus]> WIKIPEDIA;
WIKIPEDIA> "der.*"%c;
0 matches.
Am 25.08.2017 um 14:36 schrieb Hardie, Andrew:
> What happens if you search for
> "der.*"%c
> ?
>
> best
>
> Andrew.
>
>
>
> On Fri, Aug 25, 2017 at 10:03 AM +0100, "Thomas Zastrow"
> <thomas.zastrow at rzg.mpg.de <mailto:thomas.zastrow at rzg.mpg.de>> wrote:
>
> Dear all,
>
> I have a problem with a CQP indexed corpus I created from German
> Wikipedia. Everything looks fine, the "data" folder is about 10 GB and
> the indexing process showed no errors. When I go into the CQP cmd, I can
> activate the corpus:
>
> ------------------------------------------------------------------
> cqp -r /data/wp/2017/data/cqp/registry
> [no corpus]> WIKIPEDIA;
> ------------------------------------------------------------------
>
> The prompt shows now "WIKIPEDIA". Also showing infos works - partially?
> - fine:
>
> ------------------------------------------------------------------
> WIKIPEDIA> info WIKIPEDIA;
> Warning:
> Can't open info file /data/wp/2017/data/cqp/data/.info for reading
> Size: 782308286
> Charset: latin1
> Properties:
> language = '??'
> charset = 'latin1'
> ------------------------------------------------------------------
>
> Also context description looks good:
>
> ------------------------------------------------------------------
> show cd;
> ===Context Descriptor=======================================
>
> left context: 25 characters
> right context: 25 characters
> corpus position: shown
> target anchors: not shown
>
> Positional Attributes: * word
> pos
> lemma
>
> Structural Attributes: s
>
> Aligned Corpora:============================================================
> ------------------------------------------------------------------
> But unfortunately, searching for anything don't work at all:
> ------------------------------------------------------------------
> WIKIPEDIA> "der"; 0 matches.
> ------------------------------------------------------------------
> I'm glad for any help ;-) Thanks, Tom -- Dr. Thomas Zastrow Max
> Planck Computing and Data Facility (MPCDF) Gießenbachstr. 2,
> D-85748 Garching bei München, Germany Tel +49-89-3299-1457
> http://www.mpcdf.de
> _______________________________________________ CWB mailing list
> CWB at sslmit.unibo.it http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
>
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
--
Dr. Thomas Zastrow
Max Planck Computing and Data Facility (MPCDF)
Gießenbachstr. 2, D-85748 Garching bei München, Germany
Tel +49-89-3299-1457
http://www.mpcdf.de
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20170825/89954f9c/attachment.html>
More information about the CWB
mailing list