[CWB] CQPweb now supports parallel corpora

Giorgina Cerutti Benitez Giorgina.Cerutti at unige.ch
Tue Aug 2 11:03:37 CEST 2016


Yes, sorry, my question was if it is currently possible to upload tri-/quadri-lingual corpora to CQPweb, as the manual makes mainly reference to corpora in language A and B.

Best,

Giorgina

De : cwb-bounces at liste.sslmit.unibo.it<mailto:cwb-bounces at liste.sslmit.unibo.it> [mailto:cwb-bounces at liste.sslmit.unibo.it] De la part de Hardie, Andrew
Envoyé : mardi 2 août 2016 11:00
À : Open source development of the Corpus WorkBench <cwb at sslmit.unibo.it<mailto:cwb at sslmit.unibo.it>>
Objet : Re: [CWB] CQPweb now supports parallel corpora

Can you expand on the question? The Europarl corpora are not only tri-lingual, they are six-lingual*, so they already demonstrate how the system can go beyond just 2 languages.

(*Hexilingual? I’m not sure that’s a word.)

best

Andrew.

From: Giorgina Cerutti Benitez [mailto:Giorgina.Cerutti at unige.ch]
Sent: 02 August 2016 09:57
To: Open source development of the Corpus WorkBench
Cc: Hardie, Andrew
Subject: TR: CQPweb now supports parallel corpora

Dear Andrew,

Thank you very much for this very good news. I’ve been going through the Europarl corpora as well as through the manual and I wonder if trilingual data can be supported or if it should be treated as you did by uploading it by language pair.

Best regards,

Giorgina

De : cwb-bounces at liste.sslmit.unibo.it<mailto:cwb-bounces at liste.sslmit.unibo.it> [mailto:cwb-bounces at liste.sslmit.unibo.it] De la part de Hardie, Andrew
Envoyé : lundi 1 août 2016 00:41
À : Open source development of the Corpus WorkBench <cwb at sslmit.unibo.it<mailto:cwb at sslmit.unibo.it>>
Objet : [CWB] CQPweb now supports parallel corpora

Hi everyone,

CQPweb v 3.2.22 is now in the SVN repo. It adds support for parallel corpora. Since this is a much requested feature that has been on-the-list for several years, I thought it was worth sending a note to the list to let everyone know it has appeared.

Documentation for setup is in chapter 8 of the manual (also in SVN, also here: https://cqpweb.lancs.ac.uk/doc/CQPwebAdminManual.pdf

I am working on adding the Europarl corpora on the Lancaster server, so people who don’t have their own server but are interested in parallel corpora can try it out. This should be done by Mon a.m.  UK time.

Bug reports are, as ever, most appreciated.

Known issue: display of parallel data works when in categorisation mode, but there is currently no widget in the interface to switch it on (it can be switched on by manually entering the right attribute handle in the URL, but that is not then preserved across subsequent sessions in the categorisation UI). This will be fixed in a subsequent version.

best

Andrew.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20160802/5aeb32ed/attachment.html>


More information about the CWB mailing list