[CWB] CQPweb

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu Oct 11 21:36:42 CEST 2012


Hi Martí,

>> Do you think I should go back to the original configuration for the latter directory?

Yes. "AllowOverride" slows down your server (because every time it serves a page it needs to check for the presence of .htaccess files), so should usually be turned off if you don't need it.

>> Maybe a gotcha to be added is that in Mac OSX the php.ini file is initially called php.ini.default and it has to be copied as php.ini in the corresponding folder

Will do.

>> When I load the interface now I get this message

This error only appears until you carry out the final steps of setup, namely   "Click on “Reset MySQL Database” – and do it" - please do this, then see if you still get errors.

>> The manual recommends to change the initial passwd for the interface superuser

Yes, but you should do this after the "Resent database" step above. The order of the instructions in the doc is misleading due to a change in the software since I wrote it - resetting the database needs to be done *first*. I will fix this.

best

Andrew.

-----Original Message-----
From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Martí Quixal
Sent: 11 October 2012 16:59
To: cwb at sslmit.unibo.it
Subject: Re: [CWB] CQPweb

Hi Andrew,

thanks for the hint! Now it works! I had changed a line saying AllowOverride None to AllowOverride All, but it was not for the path where I have the html root, but another path (/usr/share/web/). Do you think I should go back to the original configuration for the latter directory? (I know this is not a CQP question, but just want to know if it is safe, and maybe you can help me).

Then I had problems because php did not find the socket, but fortunately I could solve that thanks to the documentation you have in the installation instructions (the gotcha). Maybe a gotcha to be added is that in Mac OSX the php.ini file is initially called php.ini.default and it has to be copied as php.ini in the corresponding folder (I can help do these things if the page is editable somewhere).

I have two more questions:

1) When I load the interface now I get this message (depending on the page I load):

A mySQL query did not run successfully!
Error # 1146: Table 'cqpwebdb.corpus_metadata_fixed' doesn't exist

2) The manual recommends to change the initial passwd for the interface superuser. I did so using the create/reset user option, but I got this message:

A mySQL query did not run successfully!
Error # 1146: Table 'cqpwebdb.user_settings' doesn't exist

Do I need to create these two tables manually?

Thanks a lot for your help!
Best!
Martí

On Thu, Oct 11, 2012 at 3:59 AM,  <cwb-request at sslmit.unibo.it> wrote:
> Send CWB mailing list submissions to
>         cwb at sslmit.unibo.it
>
> To subscribe or unsubscribe via the World Wide Web, visit
>         http://devel.sslmit.unibo.it/mailman/listinfo/cwb
> or, via email, send a message with subject or body 'help' to
>         cwb-request at sslmit.unibo.it
>
> You can reach the person managing the list at
>         cwb-owner at sslmit.unibo.it
>
> When replying, please edit your Subject line so it is more specific 
> than "Re: Contents of CWB digest..."
>
>
> Today's Topics:
>
>    1. Re: CQPweb (Hardie, Andrew)
>    2. Re: Huffman code error (Stefan Evert)
>    3. Re: Huffman code error (BOF?AS)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Wed, 10 Oct 2012 13:27:52 +0000
> From: "Hardie, Andrew" <a.hardie at lancaster.ac.uk>
> To: Open source development of the Corpus WorkBench
>         <cwb at sslmit.unibo.it>
> Subject: Re: [CWB] CQPweb
> Message-ID:
>         <28078EC3FBF1B940A3EF3D0D19BE351D0D3A16 at EX-0-MB1.lancs.local>
> Content-Type: text/plain; charset="utf-8"
>
> OK, can you check that your Apache setup is configured to respect the contents of htaccess files? The directive ?AllowOVerride All? should be present for /Library/WebServer/Documents/spintx-web .
>
> best
>
> Andrew.
>
> From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] 
> On Behalf Of Mart? Quixal
> Sent: 10 October 2012 14:16
> To: cwb at sslmit.unibo.it
> Subject: Re: [CWB] CQPweb
>
> Hi Andrew,
>
> the file is there, and the permissions seem to be ok too:
>
> guiscafred:~ mquixal$ ls -l 
> /Library/WebServer/Documents/spintx-web/adm/
> total 16
> -rw-rw-r--  1 spintxweb  cqpweb  51 Jun 11  2009 index.php
> -rw-rw-r--  1 spintxweb  cqpweb   9 Jun 11  2009 settings.inc.php
>
> guiscafred:~ mquixal$ cat 
> /Library/WebServer/Documents/spintx-web/adm/.htaccess
> AuthUserFile ../WebCorpora/AccessData/.htpasswd
>             AuthGroupFile ../WebCorpora/AccessData/.htgroup
>             AuthName CQPweb
>             AuthType Basic
>             deny from all
>             require group superusers
>             satisfy any
>
> guiscafred:~ mquixal$ ls -l 
> /Library/WebServer/Documents/spintx-web/adm/.htaccess
> -rw-rw-r--  1 root  cqpweb  181 Oct  6 08:29 
> /Library/WebServer/Documents/spintx-web/adm/.htaccess
>
> Any further suggestions?
> Best
> mq
>
>
> On Wed, Oct 10, 2012 at 5:00 AM, <cwb-request at sslmit.unibo.it<mailto:cwb-request at sslmit.unibo.it>> wrote:
> Send CWB mailing list submissions to
>         cwb at sslmit.unibo.it<mailto:cwb at sslmit.unibo.it>
>
> To subscribe or unsubscribe via the World Wide Web, visit
>         http://devel.sslmit.unibo.it/mailman/listinfo/cwb
> or, via email, send a message with subject or body 'help' to
>         
> cwb-request at sslmit.unibo.it<mailto:cwb-request at sslmit.unibo.it>
>
> You can reach the person managing the list at
>         cwb-owner at sslmit.unibo.it<mailto:cwb-owner at sslmit.unibo.it>
>
> When replying, please edit your Subject line so it is more specific 
> than "Re: Contents of CWB digest..."
>
>
> Today's Topics:
>
>    1. Re: CQPweb (Hardie, Andrew)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Tue, 9 Oct 2012 15:07:53 +0000
> From: "Hardie, Andrew" 
> <a.hardie at lancaster.ac.uk<mailto:a.hardie at lancaster.ac.uk>>
> To: Open source development of the Corpus WorkBench
>         <cwb at sslmit.unibo.it<mailto:cwb at sslmit.unibo.it>>
> Subject: Re: [CWB] CQPweb
> Message-ID:
>         <28078EC3FBF1B940A3EF3D0D19BE351D0D2D4C at EX-0-MB1.lancs.local>
> Content-Type: text/plain; charset="utf-8"
>
> Does the hidden file adm/.htaccess exist?
>
> If so, what does it contain?
>
> If not, create it with the following contents:
>
> AuthUserFile /path/to/your/.htpasswd
> AuthGroupFile /path/to/your/.htgroup
> AuthName CQPweb
> AuthType Basic
> deny from all
> require group superusers
> satisfy any
>
> and see if that fixes it. (That file is normally created by the auto config script, but if something went wrong as that script was running, it may not have been created).
>
> best
>
> Andrew.
>
> From: 
> cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it><mailto
> :cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it>> 
> [mailto:cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it
> >] On Behalf Of Mart? Quixal
> Sent: 09 October 2012 15:32
> To: 
> cwb at sslmit.unibo.it<mailto:cwb at sslmit.unibo.it><mailto:cwb at sslmit.unib
> o.it<mailto:cwb at sslmit.unibo.it>>
> Subject: Re: [CWB] CQPweb
>
> Hi Andrew,
>
> I checked that to, and it seems everything is ok there too:
>
>
> /* adminstrators' usernames, separated by | */ /*$superuser_username = 
> 'spintexweb|spintxweb';*/ $superuser_username = 'spintexweb';
>
> The second line is because at some point I tried to use to the other user.
>
> What shall I try no?
>
> a) reinstall CQPWeb from scratch?
> b) try to debug php code?
>
> Other alternatives?
>
> Thanks in advance!
> Mart?
>
> -------------- next part -------------- An HTML attachment was 
> scrubbed...
> URL: 
> <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20121009/0cff2
> e9c/attachment-0001.html>
>
> ------------------------------
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it<mailto:CWB at sslmit.unibo.it>
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
>
> End of CWB Digest, Vol 70, Issue 12
> ***********************************
>
>
>
> -------------- next part -------------- An HTML attachment was 
> scrubbed...
> URL: 
> <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20121010/091b5
> 415/attachment-0001.html>
>
> ------------------------------
>
> Message: 2
> Date: Wed, 10 Oct 2012 15:35:56 +0200
> From: Stefan Evert <stefanML at collocations.de>
> To: Open source development of the Corpus WorkBench
>         <cwb at sslmit.unibo.it>
> Subject: Re: [CWB] Huffman code error
> Message-ID: <69BFE711-953A-43C1-9E93-ED075122EA3A at collocations.de>
> Content-Type: text/plain; charset=windows-1252
>
>
>> I have the feeling this bug has come up before
>
> It has, but AFAIR this was in the context of very large corpora (> 1.5 billion words) and has to do with a deficiency in the CWB binary file format, so it cannot be fixed in a backward-compatible way.
>
>> ? can you check your version? (cqp ?v)
>
> The path indicates CWB 3.4.1, which seems to be rather ancient and will contain a lot of bugs that have been fixed in the meantime.
>
> For what it's worth, I tried the sample input file included in the e-mail with CWB 3.4.3 and 3.4.5 on my Mac and wasn't able to reproduce the error.
>
> Two observations, though:
>
> 1) The sample file in the e-mail has only 35 tokens, not 40 tokens as claimed.  So perhaps this is a cut-down version that doesn't trigger the error?
>
> 2) When copying & pasting from the e-mail, I end up with 4 blanks as column separators rather than the required TABs, which I edited before encoding, of course.  If I use blanks instead of TABs, cwb-huffcode will fail, of course, because the attributes "lema" and "pos" are empty.  However, this produces a different error message from the one reported.
>
> Best
> Stefan
>
>
>
>
> ------------------------------
>
> Message: 3
> Date: Thu, 11 Oct 2012 09:59:24 +0100
> From: BOF?AS ALBERCH, EVA <eva.bofias at upf.edu>
> To: cwb at sslmit.unibo.it
> Subject: Re: [CWB] Huffman code error
> Message-ID:
>         
> <CANfLD1pVDB0jb7iAcXc000x2fji29VGnYJLtodQvRDOUGYUscA at mail.gmail.com>
> Content-Type: text/plain; charset="iso-8859-1"
>
> Sorry, I forgot to mentions my version.
>
> Version:   3.0.2 (i download it when 3.0 was not available)
>
> When I searched I had seen something similar, but it happens with very 
> large corpora, but I have this error even with  a really tiny one.
>
> Thanks
> Eva
>
> Message: 2
> Date: Wed, 10 Oct 2012 11:55:08 +0000
> From: "Hardie, Andrew" <a.hardie at lancaster.ac.uk>
> To: Open source development of the Corpus WorkBench
>         <cwb at sslmit.unibo.it>
> Subject: Re: [CWB] Huffman code error
> Message-ID:
>         <28078EC3FBF1B940A3EF3D0D19BE351D0D38F6 at EX-0-MB1.lancs.local>
> Content-Type: text/plain; charset="iso-8859-1"
>
> I have the feeling this bug has come up before - can you check your 
> version? (cqp -v)
>
> thanks
>
> Andrew.
>
> 2012/10/10 BOF?AS ALBERCH, EVA <eva.bofias at upf.edu>
>
>> Hi,
>>
>> I have an error, I am not able to solve. I'm trying to build a Latin 
>> corpora but I get this error:
>>
>> Error: Huffman codes too long (32 bits, current maximum is 31 bits).
>>        Please contact the CWB development team for assistance.
>>
>> I got this error when trying to build a  40 words corpora (I cut it 
>> to see if I could detect the error; with 39 words I do not get the 
>> error)
>>
>> -----------
>> <doc type="CHRISTIAN_LATIN" title="Abelard"> <s>
>> PETRUS    Petrus    N:nom
>> ABAELARDUS    UNKNOUN    ADJ
>> (    (    PUN
>> 1079-1142    card    ADJ:NUM
>> )    )    PUN
>> ABAELARDI    UNKNOUN   N:voc
>> AD    UNKNOUN    N:abl
>> AMICUM    amicus    ADJ
>> SUUM    sus    N:gen
>> CONSOLATORIA    consolatorius    ADJ
>> Sepe    sepes    N:dat
>> humanos    humanus    ADJ
>> affectus    affectus    N:nom
>> aut    aut    CC
>> provocant    provoco    V:IND
>> aut    aut    CC
>> mittigant   mi    V:IND
>> amplius    ample    ADV
>> exempla    exemplum    N:nom
>> quam    qui    REL
>> verba    verbum    N:nom
>> .    .    SENT
>> </s>
>> <s>
>> Unde    unde    ADV
>> post    post    PREP
>> nonnullam    nonnullus    ADJ
>> sermonis    sermo    N:gen
>> ad    ad    PREP
>> habiti  habeo    V:PTC
>> consolationem    consolatio    N:acc
>> ,    ,    PUN
>> de    de    PREP
>> ipsis    ipse    DET
>> calamitatum    calamitas    N:gen
>> mearum    meus    POSS
>> experimentis    experimentum    N:abl
>> </s>
>> </doc>
>>
>> -----------------
>> This are the attributes I use to describe the corpus:
>>
>> cat $SOURCEFILE | /usr/local/cwb-3.4.1/bin/cwb-encode -c utf8 -d 
>> $DATADIR -R $REGDIR/$CORPUSNAME -xsB -P lema -P pos -V s  -S 
>> doc:0+type+title -S not:0+text
>>
>> Thanks
>>
>> Eva Bofias
>>
> -------------- next part -------------- An HTML attachment was 
> scrubbed...
> URL: 
> <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20121011/e7f3f
> 4ba/attachment.html>
>
> ------------------------------
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
>
> End of CWB Digest, Vol 70, Issue 14
> ***********************************



--
Martí Quixal
Computational Linguist & Educational Technologist http://www.iqubo.org/quixal _______________________________________________
CWB mailing list
CWB at sslmit.unibo.it
http://devel.sslmit.unibo.it/mailman/listinfo/cwb


More information about the CWB mailing list