[CWB] Can't create metadata
Hardie, Andrew
a.hardie at lancaster.ac.uk
Mon Nov 14 10:38:10 CET 2016
Well it looks rather as if you don't have any text tags at all there... which would be part of the problem. Try again with <text id="...">...</text> tags added to the file, as required.
As for why indexing is taking so long, it's very difficult for me to diagnose at a distance. You should keep an eye on your process list (e.g. via top) to see if anything is actually happening. As long as a cwb-*** process is running, something productive is happening, and you shouldn't abort.
best
Andrew.
-----Original Message-----
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Jiayue Wang
Sent: 13 November 2016 11:06
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] Can't create metadata
Hi Andrew,
Thanks a lot.
I deleted the us_rhodeisland corpus and tried again to install it. The
corpus file looks like this:
If IN if
you PP you
have VBP have
any DT any
questions NNS question
or CC or
suggestions NNS suggestion
how WRB how
this DT this
website NN website
might MD might
be VB be
improved VBN improve
, , ,
please VB please
feel VB feel
free JJ free
to TO to
contact VB contact
us PP us
. SENT .
The corpus contains only this file (44.0 MB). For P-attribute I selected
the POS and lemma (TreeTagger format) option. Then I clicked Install, 31
files were created in the index/us_rhodeisland folder, but the process
goes on endlessly. I interrupted this process and tried again but the
same happened. I'm wondering how long time does this approximately take
on my laptop, which has 8 GB of ram, and a, Intel i5 quadcore CPU?
Best
Jiayue
On 13/11/16 06:19, Hardie, Andrew wrote:
> This error message suggests that your <text> elements lack valid ID
> codes.
>
> The most likely reason for [UNREADABLE] is that you have declared a
> primary annotation, e.g. a part of speech tag, but the annotation in
> question does not exist. This can happen if you use a template that
> your data does not match, for instance.
>
> best
>
> Andrew.
>
> -----Original Message----- From: cwb-bounces at sslmit.unibo.it
> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Jiayue Wang Sent:
> 11 November 2016 20:17 To: Open source development of the Corpus
> WorkBench Subject: [CWB] Can't create metadata
>
> Hi,
>
> After a full installation of CQBweb I installed a corpus called
> "us_rhodeisland" (including 2 files, a raw text, and a TreeTagger
> tagged text) without metadata. Since I have no idea what a metadata
> file looks like, I selected "No thanks, I'll run this myself (safer
> for very large corpora)" and clicked "Create minimalist metadata
> table" and saw the following error message:
>
>
> A MySQL query did not run successfully!
>
>
> Original query: insert into
> ___temp_cqp_text_positions_for_us_rhodeisland (text_id, cqp_begin,
> cqp_end) VALUES ('', 0, 55858),('', 55859, 3058358) /* from User:
> admin | Function: do_append_mysql_comment() | 2016-Nov-11 20:04:20
> */
>
>
> Error # 1062: Duplicate entry '' for key 'PRIMARY'
>
>
> BTW, when I try a standard query, each concordance line begins with
> "[UNREADABLE] [UNREADABLE] [UNREADABLE]". What is the most likely
> reason?
>
> Any help is appreciated, thanks!
>
> Jiayue Wang _______________________________________________ CWB
> mailing list CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
> _______________________________________________ CWB mailing list
> CWB at sslmit.unibo.it
> http://liste.sslmit.unibo.it/mailman/listinfo/cwb
>
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
http://liste.sslmit.unibo.it/mailman/listinfo/cwb
More information about the CWB
mailing list