[CWB] Manage parallel corpora?

Hardie, Andrew a.hardie at lancaster.ac.uk
Mon Oct 17 19:51:55 CEST 2016


You are looking at an old version. Versions of the manual more recent than July this year have the following updated contents list for chapter 8:

8 Parallel corpus data 55
8.1 Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
8.2 Setting up parallel corpora . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
8.3 Naming alignment attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
8.4 Creating alignment attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
8.5 Registering alignment attributes with CQPweb . . . . . . . . . . . . . . . . . . . . . . 56
8.6 How alignment attributes can be used . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
8.7 Parallel corpora and user privileges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58

If your code is fully updated to the repo, a copy with this content should be found under /doc .

My own server’s copy is also up to date. https://cqpweb.lancs.ac.uk/doc/CQPwebAdminManual.pdf

best

Andrew

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of "Andrés Chandía"
Sent: 17 October 2016 18:28
To: andres at chandia.net; Open source development of the Corpus WorkBench
Subject: [CWB] Manage parallel corpora?

Sorry , I send it again with a proper subject....

 Hi Andrew, I asked you

... I have seen there is a "Manage parallel alignment" it means there is a way now to install parallel corpora, how?

And you answer me: " Yes, parallel corpora are now supported. See manual chapter 8."

But at the manual, the point dedicated to parallel corpora says:

8.6 Field data mode as a workaround for parallel corpora: TODO


At the online version, there is this:

8 Sentence alignment

Describe sentence aligner, encoding of alignment attributes, and import of pre-existing alignment
...?

Thanks..


On Mon, October 17, 2016 04:57, Hardie, Andrew wrote:

The code that generates this error is quite simple – it’s just a mkdir calll:
if (! mkdir($datadir))
exiterror("CQPweb could not create a directory for the frequency index. Check filesystem permissions!");
There must be something at the system level stopping the directory being created (that is, the directory {$your_corpus_index_data_directory}/{$name_of_corpus}__freq
)being created. If the directory is owned by www-data, perhaps you should check whether user has permission to write to this directory?
On the basis of the other issue you report, the problem may be that an earlier run of the frequency-table setup code has created a __freq directory that cannot
be deleted. Then, mkdir fails, because the directory already exists (because it could not be deleted).
The message “has already been created” indicates that that __freq directory does exist (which should only be the case if frequency table setup was successful).
My advice: manually remove the __freq corpus (its data directory and registry file). Then check all relevant filesystem permissions. Then, once you’re sure
they are correct run frequency list setup from scratch.
If in doubt use offline freqlists instead of the interface buttons; that may give more informative error messages if there is still a problem.
best
Andrew.
From: "Andrés Chandía" [mailto:andres at chandia.net]

Sent: 13 October 2016 20:05
To: Hardie, Andrew
Cc: Open source development of the Corpus WorkBench
Subject: CQPweb could not create a directory for the frequency index
I get this error when I try to "Cretate the CWB Frequency table"
CQPweb could not create a directory for the frequency index. Check filesystem permissions!

But there is no error for text position records or frequency tables,

directories: registry, data and cqpweb are all owned: www-data:www-data

but then if I refresh the Corpus frequency list controls page, I can see: The text-by-text list for this corpus
has already been created.


Another problem that I still have is the creation of new users:

A MySQL query did not run successfully!

Original query: INSERT INTO user_info ( username, realname, email, passhash, acct_status, expiry_time, acct_create_time, conc_kwicview, conc_corpus_order, cqp_syntax, context_with_tags, use_tooltips,

thin_default_reproducible, coll_statistic, coll_freqtogether, coll_freqalone, coll_from, coll_to, max_dbsize, linefeed ) VALUES ( 'jhuser', 'unknown person', 'jhuser at upf.edu',
'$2a$11$......................yl0Kl8h3xhUATbrHP7sKOq.GXmLA0AS', 0, 0, CURRENT_TIMESTAMP, 1, 1, 0, 0, 1, 1, , 5, 5, -3, 3, 1000000, 'au' ) /* from User: admin | Function: do_append_mysql_comment() | 2016-Oct-13 19:02:22 */

Error # 1064: You have an error in your SQL syntax; check the manual that corresponds to your MySQL server version for the right syntax to use near ' 5, 5, -3, 3, 1000000, 'au' ) /* from User: admin |

Function: do' at line 38


Thanks
_______________________
andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net>[Image removed by sender.]<https://twitter.com/andreschandia>
NMT<http://chandia.net/content/nmt> |

Dungupeyem<http://chandia.net/content/dungupeyem> |
Corlexim<http://chandia.net/corlexim>

administrador de:
Parles.upf<http://parles.upf.edu> |
Delingua<http://www.delingua.es> | Amind terapia<http://amindterapia.com> |
Mapuche koyaktu<http://koyaktumapuche.net> | Nocando<http://parles.upf.edu/llocs/nocando> |
mail: ONG Mapuche koyaktu<http://mail.corporacionkoyaktu.net> |
Psicoaching<http://mail.psicoaching.net> |
P No imprima innecesariamente. ¡Cuide el medio ambiente!






_______________________
andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net>[Image removed by sender.]<https://twitter.com/andreschandia>
NMT<http://chandia.net/content/nmt> | Dungupeyem<http://chandia.net/content/dungupeyem> | Corlexim<http://chandia.net/corlexim>

administrador de:
Parles.upf<http://parles.upf.edu> | Delingua<http://www.delingua.es> | Amind terapia<http://amindterapia.com> | Mapuche koyaktu<http://koyaktumapuche.net> | Nocando<http://parles.upf.edu/llocs/nocando> |
mail: ONG Mapuche koyaktu<http://mail.corporacionkoyaktu.net> | Psicoaching<http://mail.psicoaching.net> |
P No imprima innecesariamente. ¡Cuide el medio ambiente!


_______________________
            andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net>[Image removed by sender.]<https://twitter.com/andreschandia>
NMT<http://chandia.net/content/nmt> | Dungupeyem<http://chandia.net/content/dungupeyem> | Corlexim<http://chandia.net/corlexim>

administrador de:
Parles.upf<http://parles.upf.edu> | Delingua<http://www.delingua.es> | Amind terapia<http://amindterapia.com> | Mapuche koyaktu<http://koyaktumapuche.net> | Nocando<http://parles.upf.edu/llocs/nocando> |
mail: ONG Mapuche koyaktu<http://mail.corporacionkoyaktu.net> | Psicoaching<http://mail.psicoaching.net> |
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20161017/94c2cf87/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ~WRD000.jpg
Type: image/jpeg
Size: 823 bytes
Desc: ~WRD000.jpg
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20161017/94c2cf87/attachment-0001.jpg>


More information about the CWB mailing list