<p dir="ltr">Hi jiayue,</p>
<p dir="ltr">My team works with verticalized texts like this:</p>
<p dir="ltr"><text id="ST1" title="namewithoutspaces" author="name"><br>
<s><br>
word pos lemma<br>
word pos lemma<br>
word pos lemma<br>
word pos lemma<br>
</s><br>
</text></p>
<p dir="ltr"><text id="ST2" title="anothertextname" author="otherperson"><br>
<s><br>
word pos lemma<br>
word pos lemma<br>
word pos lemma<br>
</s><br>
</text></p>
<p dir="ltr">You can add more text tags as: author_sex, language, year, translator...<br>
</p>
<div class="gmail_extra"><br><div class="gmail_quote">El 14 nov. 2016 2:37 p. m., "Jiayue Wang" <<a href="mailto:arthur0421@gmail.com">arthur0421@gmail.com</a>> escribió:<br type="attribution"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">Thanks Andrew. I still don't understand where the tags <text id=""> and </text> should be added. Should they enclose a corpus file? I notice that section 7.6 "Metadata template" of the CQPwebAdminManual is empty. Could you show me a template?<br>
<br>
Best,<br>
Jiayue<br>
<br>
On 14/11/16 09:38, Hardie, Andrew wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Well it looks rather as if you don't have any text tags at all there... which would be part of the problem. Try again with <text id="...">...</text> tags added to the file, as required.<br>
<br>
As for why indexing is taking so long, it's very difficult for me to diagnose at a distance. You should keep an eye on your process list (e.g. via top) to see if anything is actually happening. As long as a cwb-*** process is running, something productive is happening, and you shouldn't abort.<br>
<br>
best<br>
<br>
Andrew.<br>
<br>
-----Original Message-----<br>
From: <a href="mailto:cwb-bounces@sslmit.unibo.it" target="_blank">cwb-bounces@sslmit.unibo.it</a> [mailto:<a href="mailto:cwb-bounces@sslmit.unibo.it" target="_blank">cwb-bounces@sslmit.uni<wbr>bo.it</a>] On Behalf Of Jiayue Wang<br>
Sent: 13 November 2016 11:06<br>
To: Open source development of the Corpus WorkBench<br>
Subject: Re: [CWB] Can't create metadata<br>
<br>
Hi Andrew,<br>
<br>
Thanks a lot.<br>
I deleted the us_rhodeisland corpus and tried again to install it. The<br>
corpus file looks like this:<br>
<br>
If IN if<br>
you PP you<br>
have VBP have<br>
any DT any<br>
questions NNS question<br>
or CC or<br>
suggestions NNS suggestion<br>
how WRB how<br>
this DT this<br>
website NN website<br>
might MD might<br>
be VB be<br>
improved VBN improve<br>
, , ,<br>
please VB please<br>
feel VB feel<br>
free JJ free<br>
to TO to<br>
contact VB contact<br>
us PP us<br>
. SENT .<br>
<br>
The corpus contains only this file (44.0 MB). For P-attribute I selected<br>
the POS and lemma (TreeTagger format) option. Then I clicked Install, 31<br>
files were created in the index/us_rhodeisland folder, but the process<br>
goes on endlessly. I interrupted this process and tried again but the<br>
same happened. I'm wondering how long time does this approximately take<br>
on my laptop, which has 8 GB of ram, and a, Intel i5 quadcore CPU?<br>
<br>
Best<br>
Jiayue<br>
<br>
On 13/11/16 06:19, Hardie, Andrew wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
This error message suggests that your <text> elements lack valid ID<br>
codes.<br>
<br>
The most likely reason for [UNREADABLE] is that you have declared a<br>
primary annotation, e.g. a part of speech tag, but the annotation in<br>
question does not exist. This can happen if you use a template that<br>
your data does not match, for instance.<br>
<br>
best<br>
<br>
Andrew.<br>
<br>
-----Original Message----- From: <a href="mailto:cwb-bounces@sslmit.unibo.it" target="_blank">cwb-bounces@sslmit.unibo.it</a><br>
[mailto:<a href="mailto:cwb-bounces@sslmit.unibo.it" target="_blank">cwb-bounces@sslmit.uni<wbr>bo.it</a>] On Behalf Of Jiayue Wang Sent:<br>
11 November 2016 20:17 To: Open source development of the Corpus<br>
WorkBench Subject: [CWB] Can't create metadata<br>
<br>
Hi,<br>
<br>
After a full installation of CQBweb I installed a corpus called<br>
"us_rhodeisland" (including 2 files, a raw text, and a TreeTagger<br>
tagged text) without metadata. Since I have no idea what a metadata<br>
file looks like, I selected "No thanks, I'll run this myself (safer<br>
for very large corpora)" and clicked "Create minimalist metadata<br>
table" and saw the following error message:<br>
<br>
<br>
A MySQL query did not run successfully!<br>
<br>
<br>
Original query: insert into<br>
___temp_cqp_text_positions_for<wbr>_us_rhodeisland (text_id, cqp_begin,<br>
cqp_end) VALUES ('', 0, 55858),('', 55859, 3058358) /* from User:<br>
admin | Function: do_append_mysql_comment() | 2016-Nov-11 20:04:20<br>
*/<br>
<br>
<br>
Error # 1062: Duplicate entry '' for key 'PRIMARY'<br>
<br>
<br>
BTW, when I try a standard query, each concordance line begins with<br>
"[UNREADABLE] [UNREADABLE] [UNREADABLE]". What is the most likely<br>
reason?<br>
<br>
Any help is appreciated, thanks!<br>
<br>
Jiayue Wang ______________________________<wbr>_________________ CWB<br>
mailing list <a href="mailto:CWB@sslmit.unibo.it" target="_blank">CWB@sslmit.unibo.it</a><br>
<a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" rel="noreferrer" target="_blank">http://liste.sslmit.unibo.it/m<wbr>ailman/listinfo/cwb</a><br>
______________________________<wbr>_________________ CWB mailing list<br>
<a href="mailto:CWB@sslmit.unibo.it" target="_blank">CWB@sslmit.unibo.it</a><br>
<a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" rel="noreferrer" target="_blank">http://liste.sslmit.unibo.it/m<wbr>ailman/listinfo/cwb</a><br>
<br>
</blockquote>
______________________________<wbr>_________________<br>
CWB mailing list<br>
<a href="mailto:CWB@sslmit.unibo.it" target="_blank">CWB@sslmit.unibo.it</a><br>
<a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" rel="noreferrer" target="_blank">http://liste.sslmit.unibo.it/m<wbr>ailman/listinfo/cwb</a><br>
______________________________<wbr>_________________<br>
CWB mailing list<br>
<a href="mailto:CWB@sslmit.unibo.it" target="_blank">CWB@sslmit.unibo.it</a><br>
<a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" rel="noreferrer" target="_blank">http://liste.sslmit.unibo.it/m<wbr>ailman/listinfo/cwb</a><br>
<br>
</blockquote>
______________________________<wbr>_________________<br>
CWB mailing list<br>
<a href="mailto:CWB@sslmit.unibo.it" target="_blank">CWB@sslmit.unibo.it</a><br>
<a href="http://liste.sslmit.unibo.it/mailman/listinfo/cwb" rel="noreferrer" target="_blank">http://liste.sslmit.unibo.it/m<wbr>ailman/listinfo/cwb</a><br>
</blockquote></div></div>