[CWB] problem at managing corpus metadata

Andres Chandia andres at chandia.net
Tue Dec 31 20:31:35 CET 2013



Thanks, I turned back to the previous one that I had CQPweb v3.0.7 © 2008-2012 and all
went well
except for this:

1. I have introduced at the second line
s-attributes this way:  s:0+id+type, then at the restricted query for the s_id it only appears
 the data for S1 and S3, but not for S2 and S4, if you see the corpus  text_1 owns S1 and S2,
text_2 owns S3 and S4, so only  appears the first  S of each text
 

And I take
the opportunity to ask you how do I upgrade with svn to the version you are recommending 


Thanks, and If you don't answer right now I would understand it, Have a Great
New Year's Eve!!!

On Tue, December 31, 2013 19:26, Hardie, Andrew wrote:
 <style type="text/css">-></style>


If
you downgrade to 3.0.16 this will go away.


To
repeat my earlier warning: 3.1.0 is currently unstable and no one but me should be using it
till further notice!


best


Andrew.


From:
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of
Andres Chandia
 Sent: 31 December 2013 13:45

To: Open source development of the Corpus WorkBench

Subject: Re: [CWB] problem at managing corpus metadata
 
Ok, it works, thanks a lot... I have some
more issues though....
 
 1. I have introduced at the second line s-attributes this
way: s:0+id+type, then at the restricted query for the s_id it only appears the data for S1
and S3, but not for S2 and S4, if you see the corpus text_1 owns S1 and S2, text_2 owns S3 and
S4, so only  appears the first S of each text
 
 
 2. at the "Create
metadata table from corpus XML annotations" step is the option "Do you want to
automatically run frequency-list setup?" if I select "yes" everything goes
well, but if I select "No thanks, I'll run this myself (safer for very large
corpora)"  and I go to that step, on clicking the "The text metadata table has not
yet been populated with begin/end offset positions. Use the button below to generate this
data." button I get next error:
A mySQL query did not run successfully!
Error # 1146: Table 'cqp.text_metadata_for_' doesn't exist 
  
PHP debugging backtrace
array(6) {   [0]=>   array(4) {     ["file"]=>     string(40)
"/srv/web/llocs/cqp/lib/exiterror.inc.php"     ["line"]=>    
int(260) 
   ["function"]=>     string(17) "exiterror_endpage"    
["args"]=>     array(0) {     }   }   [1]=>   array(4) {    
["file"]=>     string(38) "/srv/web/llocs/cqp/lib/library.inc.php"
   
["line"]=>     int(227)     ["function"]=>     string(20)
"exiterror_mysqlquery"     ["args"]=>     array(2) {      
[0]=>    
  &int(1146)       [1]=>       &string(44) "Table
'cqp.text_metadata_for_'
doesn't exist"     }   }   [2]=>   array(4) {     ["file"]=>    
string(40) "/srv/web/llocs/cqp/lib/admin-lib.inc.php"    
["line"]=>   
 int(516)     ["function"]=>     string(14) "do_mysql_query"   

["args"]=>     array(1) {       [0]=>       &string(83)
"update
text_metadata_for_                    set cqp_begin = 0, cqp_end = 41                    
  where text_id = '1'"     }   }  
[3]=>   array(2) {     ["function"]=>     string(29)
"populate_corpus_cqp_positions"     ["args"]=>     array(0) {    
}   }
  [4]=>   array(4) {     ["file"]=>     string(38)
"/srv/web/llocs/cqp/lib/execute.inc.php"     ["line"]=>    
int(154)   
 ["function"]=>     string(20) "call_user_func_array"    
["args"]=>     array(2) {       [0]=>       &string(29)
"populate_corpus_cqp_positions"       [1]=>       &array(0) {       }   
 }  
}   [5]=>   array(4) {     ["file"]=>     string(38)
"/srv/web/llocs/cqp/testing/execute.php"     ["line"]=>     int(1)
   
["args"]=>     array(1) {       [0]=>       string(38)
"/srv/web/llocs/cqp/lib/execute.inc.php"     }     ["function"]=> 
  
string(7) "require"   } }
 Thanks a lot

 
 
 On Tue, December 31, 2013 08:37, Hardie, Andrew
wrote:


Ah.
That makes it clear what the problem is. You have specified name, lang and season as separate 
elements, not as attributes
on the text element.
Try
putting the following in a single box:
text:0+id+name+lang+season
best
Andrew.
PS
as you may have read in my email from last week, upgrading to 3.1.0 is not advised, as I will 
be breaking things in the new major version. I recommend sticking with 3.0.16 till further
notice. 
From: 
Andres Chandia [mailto:andres at chandia.net] 
 Sent: 30
December 2013 18:30
 To: Hardie,
Andrew
 Cc:
cwb at sslmit.unibo.it
 Subject: Re:
[CWB] problem at managing corpus metadata
No changes, I have
made a svn update and now I have next version: CQPweb v3.1.0 © 2008-2014 
 
 I
attach the corpus, an image of the settings I put and the registry file created which I named
.reg
 
 Thanks for your patience
 
 
 On Sun, December 29, 2013
18:23, Hardie, Andrew wrote:

Then please
upgrade to 3.0.16 and see 2ether the problem persists.

best


Andrew.


 
 Andres
Chandia wrote:

CQPweb v3.0.7 ©
2008-2012 
 
 On Sun, December 29, 2013 02:49, Hardie, Andrew wrote:


Can
you tell me what version of the code you are using?
Thanks
Andrew.
From: 
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of
Andres Chandia
 Sent: 28
December 2013 17:51
 To: Open source
development of the Corpus WorkBench
 Subject: Re:
[CWB] problem at managing corpus metadata
yes, this is
what I see in the registry:
 ##
 ##
s-attributes (structural markup)
 ##
 
 # ... 
 STRUCTURE name
 
 # ... 
 STRUCTURE lang
 
 # ... 
 STRUCTURE season
 
 # ... 
 # (no recursive embedding
allowed)
 STRUCTURE text
 STRUCTURE text_id # [annotations]
 
 I didn't know if I should
ad the "id" attribute also....
 anyway, I still get:

 
 


 
 
 
 

    
        
            
             My metadata is embedded in the XML of my
corpus!
            
        
        
            
             
 No XML annotations found for this corpus. 
            
        
    

thanks


 On Sat, December 28, 2013 18:06, Hardie, Andrew wrote:

That function
reads the list of attributes straight from the registry file. Can you please check in the
registry file that the s-attributes really have been indexed as per your  expectations? 

Thanks,


Andrew.


 
 

Andres Chandia wrote:

Ok, I managed
using A, but for B this is what I get :

 
 


 
 
 
 

    
        
            
             My metadata is embedded in the XML of my
corpus!
            
        
        
            
             
 No XML annotations found for this corpus. 
            
        
    


 and I
have indexed all the text attibutes as if they were s-attributes
 
 
 On Sat,
December 28, 2013 03:40, Hardie, Andrew wrote:


Hi
Andres,
What
you are doing wrong, it would seem, is re-using the source data file as the input file for the
 metadata. This is not how it works.
There
are two ways to add metadata:
A. From
a tab-delimited table file, with one text per line, and fields in columns. This is the function you are
using, and it is resulting in an error message because you are feeding back in the original
vertical file, which is not in the expected format.
B. From
XML in the original data. This is what you want to
do, but in order to do it you need to (a) have indexed all the attributes on the
text element as s-attributes, (b) use the function labelled “Create metadata table from 
corpus XML annotations” – accessed via the button low down on the screen labelled
“My metadata is embedded in the XML of my corpus!” – instead of the standard
function. 
Hope
that clarifies.
best
Andrew.
From: 
Andres Chandia [mailto:andres at chandia.net] 
 Sent: 27
December 2013 14:45
 To: Hardie,
Andrew
 Cc: Open source
development of the Corpus WorkBench
 Subject: problem
at managing corpus metadata
Hi there, so
long...
 
 I have indexed a test corpus and now I'm trying to add some metadata to
it but I always got error messages:
 
 this is what you can find at corpus:


 
 Almosnino almosnino NCMS000
 , , Fc
 Moshe moshe NCFS000
 . . Fp
 . . Fp
 
 
 Regimiento regimiento NCMS000
 de de SPS00
 la el DA0FS0
 vida vida NCFS000
 . . Fp
 . . Fp
 
 
 Salónica salónica NCFS000
 1564 1564
Z
 Transcription transcription NCFS000
 . . Fp
 . . Fp
 
 
 
 I add an image of the settings that I use to install metadata
 
 and this is what
I always got:
 
 CQPweb encountered an error and could not
continue. The data source you specified for the text metadata contains badly-formatted text ID
codes, as follows: ','; '.'; ''; ''; ''; ''; ''; ' CQPweb v3.0.7 © 2008-2012 Corpus and
tagset help You  are logged in as user [admin] 
 
 but as you can see at the
corpus above none of the metadatas contains what the error message says....
 
 well,
I don't know what I'm doing wrong, thanks in advance for your help....
 

_______________________
 andrés chandía
 [IMAGE
REMOVED]
 administrador de
 parles.upf.edu
 psicoaching.net
 mapuche
koyaktu
 ong mapuche koyaktu
 P No imprima innecesariamente. ¡Cuide el medio
ambiente!



 


 _______________________
 andrés chandía
 [IMAGE
REMOVED]
 administrador de
 parles.upf.edu
 psicoaching.net
 mapuche
koyaktu
 ong mapuche koyaktu
 P No imprima innecesariamente. ¡Cuide el medio
ambiente!



 


 
 _______________________
 andrés chandía
 [IMAGE REMOVED]
 administrador de
 parles.upf.edu
 psicoaching.net
 mapuche koyaktu
 ong mapuche
koyaktu
 P
No imprima innecesariamente. ¡Cuide
el medio ambiente!



 
 
 _______________________
            
andrés chandía
 
 administrador de
 parles.upf.edu
 psicoaching.net
 mapuche koyaktu
 ong mapuche koyaktu
 P No imprima innecesariamente. ¡Cuide el medio
ambiente!






 


_______________________
            andrés
chandía

administrador de
parles.upf.edu
psicoaching.net
mapuche koyaktu
ong mapuche koyaktu
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20131231/37bee371/attachment-0001.html>


More information about the CWB mailing list