[CWB] Can't create metadata

Hardie, Andrew a.hardie at lancaster.ac.uk
Mon Nov 14 10:38:10 CET 2016


Well it looks rather as if you don't have any text tags at all there... which would be part of the problem. Try again with <text id="...">...</text> tags added to the file, as required.

As for why indexing is taking so long, it's very difficult for me to diagnose at a distance. You should keep an eye on your process list (e.g. via top) to see if anything is actually happening. As long as a cwb-*** process is running, something productive is happening, and you shouldn't abort.

best

Andrew.

-----Original Message-----
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Jiayue Wang
Sent: 13 November 2016 11:06
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] Can't create metadata

Hi Andrew,

Thanks a lot.
I deleted the us_rhodeisland corpus and tried again to install it. The 
corpus file looks like this:

If      IN      if
you     PP      you
have    VBP     have
any     DT      any
questions       NNS     question
or      CC      or
suggestions     NNS     suggestion
how     WRB     how
this    DT      this
website NN      website
might   MD      might
be      VB      be
improved        VBN     improve
,       ,       ,
please  VB      please
feel    VB      feel
free    JJ      free
to      TO      to
contact VB      contact
us      PP      us
.       SENT    .

The corpus contains only this file (44.0 MB). For P-attribute I selected 
the POS and lemma (TreeTagger format) option. Then I clicked Install, 31 
files were created in the index/us_rhodeisland folder, but the process 
goes on endlessly. I interrupted this process and tried again but the 
same happened. I'm wondering how long time does this approximately take 
on my laptop, which has 8 GB of ram, and a, Intel i5 quadcore CPU?

Best
Jiayue

On 13/11/16 06:19, Hardie, Andrew wrote:
> This error message suggests that your <text> elements lack valid ID
> codes.
>
> The most likely reason for [UNREADABLE] is that you have declared a
> primary annotation, e.g. a part of speech tag, but the annotation in
> question does not exist. This can happen if you use a template that
> your data does not match, for instance.
>
> best
>
> Andrew.
>
> -----Original Message----- From: cwb-bounces at sslmit.unibo.it
> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Jiayue Wang Sent:
> 11 November 2016 20:17 To: Open source development of the Corpus
> WorkBench Subject: [CWB] Can't create metadata
>
> Hi,
>
> After a full installation of CQBweb I installed a corpus called
> "us_rhodeisland" (including 2 files, a raw text, and a TreeTagger
> tagged text) without metadata. Since I have no idea what a metadata
> file looks like, I selected "No thanks, I'll run this myself (safer
> for very large corpora)" and clicked "Create minimalist metadata
> table" and saw the following error message:
>
>
> A MySQL query did not run successfully!
>
>
> Original query: insert into
> ___temp_cqp_text_positions_for_us_rhodeisland (text_id, cqp_begin,
> cqp_end) VALUES ('', 0, 55858),('', 55859, 3058358) /* from User:
> admin | Function: do_append_mysql_comment() | 2016-Nov-11 20:04:20
> */
>
>
> Error # 1062: Duplicate entry '' for key 'PRIMARY'
>
>
> BTW, when I try a standard query, each concordance line begins with
> "[UNREADABLE] [UNREADABLE] [UNREADABLE]". What is the most likely
> reason?
>
> Any help is appreciated, thanks!
>
> Jiayue Wang _______________________________________________ CWB
> mailing list CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
> _______________________________________________ CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
http://liste.sslmit.unibo.it/mailman/listinfo/cwb


More information about the CWB mailing list