[CWB] Indexing of metadata problem + no display of query results
Hardie, Andrew
a.hardie at lancaster.ac.uk
Fri Nov 20 13:28:59 CET 2015
Re A -- this is exactly what it says, i.e. the CWB index of the original corpus contains words that aren't within a <text></text> span, which is not allowed in CQPweb. So either something has gone wrong in your input data, or else something went wrong with the creation of the original index. The problem is not in the frequency list creation script.
Re B -- the blank page should be accompanied by an entry in your web server's e3rror log which will tell you at what point things went wrong. In Apache that's usually /var/log/apache2/error.log or something similar.
best
Andrew.
-----Original Message-----
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Emmanuel CARTIER
Sent: 20 November 2015 12:11
To: cwb at sslmit.unibo.it
Subject: [CWB] Indexing of metadata problem + no display of query results
Hi,
I am currently working with the last version of CWB and with CQPWeb
version 3.0.16.
I managed to index big corpora (from 100 to 500 Go) on the command line
and install the corpora on CQPWeb.
I have two problems:
A.
When I launch the offline-freq-list.php (php
../bin/offline-freqlists.php <corpora name in lowercase) for generating
metadata indexes, it generates the following error :
</pre>
<p class="errormessage">CQPweb encountered an error and could not
continue.</p>
<p class="errormessage">Unexpected line outside <text> tags while
creating corpus
POLOGNE_2015__FREQ! -- creation aborted</p>
<p class="errormessage">... in file
<b>/var/www/CQPweb/lib/freqtable-cwb.inc.php</b> line <b>177</b>.</p>
Afterwards, it does not unable to query the corpus, but can you indicate
me some hints to debug it?
B. When querying my corpus (pos-tagged with treetagger, then post
processed to transform <unknown> lemma to "unknown") with the following
CQP query [lemma="unknown'], the web interface always ends with a blank
page. But when I use the commandline cqp utility, it is outputing the
results normally. can you give me some hints on that?
Thanks a lot for your work and help,
Emmanuel
--
Emmanuel Cartier
Enseignant-Chercheur en Linguistique Informatique
LIPN CNRS UMR 7030 - équipe RCLN
http://lipn.univ-paris13.fr/fr/rcln
Université Paris 13 Sorbonne Paris Cité
99 avenue Jean-Baptiste Clement
93430 Villetaneuse
tél. : (+33) 06 46 79 12 86
email : emmanuel.cartier at univ-paris13.fr
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
http://devel.sslmit.unibo.it/mailman/listinfo/cwb
More information about the CWB
mailing list