[CWB] Manage parallel corpora?

"Andrés Chandía" andres at chandia.net
Mon Jan 16 15:17:40 CET 2017



Hi there, in the
8.4 Creating alignment attributes . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . 56
section it says:

"There exist multiple
methods for indexing alignment attributes. They are discussed in the CWB Corpus Encoding
Tutorial."

but in the  CWB Corpus Encoding Tutorial it says:

8
Sentence alignment
Describe sentence aligner, encoding of alignment attributes, and
import of pre-existing alignment   "TODO"

where can I find the
actual information about "alignment attributes"?


El Lun, 17 de
Octubre de 2016, 18:51, Hardie, Andrew escribió:
 



You
are looking at an old version. Versions of the manual more recent than July this year have the
following updated contents list for chapter 8:


8 Parallel corpus data 55 
8.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . . 55
 
8.2 Setting up parallel corpora . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . 55
 
8.3 Naming alignment attributes . . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . 55
 
8.4 Creating alignment attributes . . . . . . . . . . . . . . . . . . . .
. . . . . . . . . . . . 56
 
8.5 Registering alignment attributes with CQPweb . . . . . . . . . . . .
. . . . . . . . . . 56
 
8.6 How alignment attributes can be used . . . . . . . . . . . . . . . .
. . . . . . . . . . . 57
 
8.7 Parallel corpora and user privileges . . . . . . . . . . . . . . . .
. . . . . . . . . . . . . 58


If
your code is fully updated to the repo, a copy with this content should be found under

/doc .


My
own server’s copy is also up to date.
 https://cqpweb.lancs.ac.uk/doc/CQPwebAdminManual.pdf
 


best


Andrew


From:
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it]
 On Behalf Of
"Andrés Chandía"
 Sent: 17
October 2016 18:28
 To: andres at chandia.net; Open source development of
the Corpus WorkBench
 Subject: [CWB] Manage parallel corpora?
 
Sorry , I send it again with a proper subject....
 
  Hi
Andrew, I asked you
 
 ... I have seen there is a "Manage parallel
alignment" it means there is a way now to install parallel corpora, how?
 
 And
you answer me: " Yes, parallel corpora are now supported. See manual chapter 8."
 
 But at the manual, the point dedicated to parallel corpora says:
 
 8.6
Field data mode as a workaround for parallel corpora: TODO
 
 
 At the online
version, there is this:
 
 8 Sentence alignment 

Describe sentence aligner, encoding of alignment attributes, and
import of pre-existing alignment 
 
...?
 
 Thanks..
 
 
 On Mon, October 17, 2016 04:57, Hardie, Andrew wrote:


 


The
code that generates this error is quite simple
–
it’s just a mkdir calll:
 
if
(! mkdir($datadir))
exiterror("CQPweb
could not create a directory for the frequency index. Check filesystem
permissions!");
There
must be something at the system level stopping the directory being created (that is, the
directory
  {$your_corpus_index_data_directory}/{$name_of_corpus}__freq
 )being
created. If the directory is owned by www-data, perhaps you should check whether user has
permission to write to this directory?
On
the basis of the other issue you report, the problem may be that an earlier run of the
frequency-table
  setup code has created a __freq directory that cannot
 be deleted.
Then, mkdir fails, because the directory already exists (because it could not be
deleted).
The
message â€Ã
“has already been
created”
  indicates that that __freq directory does exist (which should only be the case if
frequency table setup was successful).
My
advice: manually remove the __freq corpus (its data directory and registry file). Then check
all
  relevant filesystem permissions. Then, once
you’re sure
 they are correct run
frequency list setup from scratch.
If
in doubt use offline freqlists instead of the interface buttons; that may give more
informative
  error messages if there is still a problem.
best
Andrew.
From:
  "Andrés
Chandía" [mailto:andres at chandia.net]
 
 Sent: 13 October
2016 20:05
 To: Hardie,
Andrew
 Cc: Open source
development of the Corpus WorkBench
 Subject: CQPweb
could not create a directory for the frequency index
I get this
error when I try to "Cretate the CWB Frequency table"
 CQPweb could not create
a directory for the frequency index. Check filesystem permissions!
 
 But there is
no error for text position records or frequency tables,
 
 directories: registry,
data and cqpweb are all owned: www-data:www-data
 
 but then if I refresh the Corpus
frequency list controls page, I can see: The text-by-text list for this corpus

has already been created.
 
 
 Another problem that I still
have is the creation of new users:
 
 A MySQL query did not run successfully!


 Original query: INSERT INTO user_info ( username, realname, email, passhash,
acct_status, expiry_time, acct_create_time, conc_kwicview, conc_corpus_order, cqp_syntax,
context_with_tags, use_tooltips,
 
 thin_default_reproducible, coll_statistic,
coll_freqtogether, coll_freqalone, coll_from, coll_to, max_dbsize, linefeed ) VALUES (
'jhuser', 'unknown person', 'jhuser at upf.edu',

'$2a$11$......................yl0Kl8h3xhUATbrHP7sKOq.GXmLA0AS', 0, 0, CURRENT_TIMESTAMP, 1, 1,
0, 0, 1, 1, , 5, 5, -3, 3, 1000000, 'au' ) /* from User: admin | Function:
do_append_mysql_comment() | 2016-Oct-13 19:02:22 */
 
 Error # 1064: You have an
error in your SQL syntax; check the manual that corresponds to your MySQL server version for
the right syntax to use near ' 5, 5, -3, 3, 1000000, 'au' ) /* from User: admin |
 

Function: do' at line 38
 
 
 Thanks
 _______________________

andrés chandía
 
 NMT |
 

Dungupeyem | 
 
 Corlexim
 
 administrador
de:
 Parles.upf |
 
 
Delingua | 
 Amind terapia | 
 
 Mapuche
koyaktu | 
 Nocando |
 mail: 
 ONG Mapuche koyaktu | 
 
 Psicoaching |
 P No imprima innecesariamente. ¡Cuide el
medio ambiente!



 




 
 
 
 
 _______________________

andrés chandía
 
 NMT |
 

Dungupeyem | 
 Corlexim
 
 administrador de:
 Parles.upf |
 
Delingua | 
 Amind terapia | 
 Mapuche koyaktu | 
 Nocando |
 mail: 
 ONG Mapuche koyaktu | 
 Psicoaching |
 P No imprima innecesariamente. ¡Cuide el
medio ambiente!
 
 
 
 _______________________
            
andrés chandía
 
 NMT |
 

Dungupeyem | 
 Corlexim
 
 administrador de:
 Parles.upf | 
Delingua | Amind terapia |

 Mapuche koyaktu | Nocando |

mail: ONG Mapuche koyaktu | 
 Psicoaching |
 P No imprima innecesariamente. ¡Cuide el medio
ambiente!


 

 
   


_______________________

            andrés
chandía

NMT |
Dungupeyem | Corlexim

administrador de:
Parles.upf | Amind
terapia | Mapuche koyaktu | Nocando |
mail: ONG Mapuche koyaktu | Psicoaching |
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20170116/5db13aba/attachment.html>


More information about the CWB mailing list