[CWB] Manage parallel corpora?
"Andrés Chandía"
andres at chandia.net
Tue Jan 24 21:16:07 CET 2017
Ok, here it is what I have made available at our interface:
BancTrad Deutsch-CatalÃ
(de)
BancTrad
Castellano-Français (fr)
Â
BancTrad English-Castellano (en)
BancTrad Deutsch-Castellano
(de)
BancTrad
English-Català (en)
LTC English-Català o
(en)
LTC
English-Català d (en)
LTC English-Català v12 (en)
thanks...
El Mar, 17 de Enero de 2017, 15:45,
Hardie, Andrew escribió:
The
latex source of the manuals / tutorials is on a different main path in the repo:
http://svn.code.sf.net/p/cwb/code/doc/tutorials
but
in any case I have now updated the copies on the website.
best
Andrew.
From:
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it]
On Behalf Of
"Andrés ChandÃÂa"
Sent: 16
January 2017 17:32
To: Open source development of the Corpus
WorkBench
Subject: Re: [CWB] Manage parallel corpora?
I tried:
svn export
http://svn.code.sf.net/p/cwb/code/cwb/branches/3.0 cwb-3.0
and
svn export
http://svn.code.sf.net/p/cwb/code/cwb/trunk
cwb
but I couldn't find it neither....
El Lun,
16 de Enero de 2017, 18:26, Hardie, Andrew escribió:
This
is just a matter of the version of the tutorial on the website not being up to date with what
is in the SVN repo âââ‰â¬Å apologies,
Iâââ‰â¢ll update it tomorrow
morning!
best
Andrew.
From:
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it]
On Behalf Of
"Andr̮̩s
ChandÃÆÃÂa"
Sent: 16 January
2017 14:18
To: Open source
development of the Corpus WorkBench
Subject: Re:
[CWB] Manage parallel corpora?
Hi there, in the
8.4 Creating alignment attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. . . 56
section it says:
"There exist multiple methods for indexing
alignment attributes. They are discussed in the CWB Corpus Encoding Tutorial."
but in the CWB Corpus Encoding Tutorial it says:
8 Sentence alignment
Describe sentence aligner, encoding of alignment attributes, and import of pre-existing
alignment "TODO"
where can I find the actual information about
"alignment attributes"?
El Lun, 17 de Octubre de 2016, 18:51,
Hardie, Andrew escribiÃÆó:
You
are looking at an old version. Versions of the manual more recent than July this year have
the
following updated contents list for chapter 8:
8 Parallel
corpus data 55
8.1
Introduction . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . .
. 55
8.2 Setting up
parallel corpora . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
8.3 Naming
alignment attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 55
8.4 Creating
alignment attributes . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 56
8.5
Registering alignment attributes with CQPweb . . . . . . . . . . . . . . . . . . . . . .
56
8.6 How
alignment attributes can be used . . . . . . . . . . . . . . . . . . . . . . . . . . . 57
8.7 Parallel
corpora and user privileges . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 58
If
your code is fully updated to the repo, a copy with this content should be found under
/doc
.
My
own
serverÃÆââââ¬Å¡Ã¬Ã¢ââ¬Ã¢s
copy is also up to date.
https://cqpweb.lancs.ac.uk/doc/CQPwebAdminManual.pdf
best
Andrew
From:
cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it]
On Behalf Of
"Andr̮̩̉̉s
ChandÃÆÃâÃâÃÂa"
Sent: 17 October
2016 18:28
To:
andres at chandia.net; Open source development of the Corpus
WorkBench
Subject: [CWB]
Manage parallel corpora?
Sorry , I send
it again with a proper subject....
Hi Andrew, I asked you
... I have
seen there is a "Manage parallel alignment" it means there is a way now to install
parallel corpora, how?
And you answer me: " Yes, parallel corpora are now
supported. See manual chapter 8."
But at the manual, the point dedicated to
parallel corpora says:
8.6 Field data mode as a workaround for parallel corpora:
TODO
At the online version, there is this:
8 Sentence alignment
Describe sentence aligner, encoding of alignment attributes, and
import of pre-existing alignment
...?
Thanks..
On Mon, October 17, 2016 04:57, Hardie, Andrew wrote:
The
code that generates this error is quite simple
ÃÆÃâÃââÃÆââââ¬Ã
¡ÃâìÃÆââââ¬Å¡Ã¬Ãâ¦Ã¢â¬Å
itÃÆÃâÃââÃÆââââ¬Ã
¡ÃâìÃÆââââ¬Ã
¾Ãââs
just
a mkdir calll:
if
(! mkdir($datadir))
exiterror("CQPweb
could not create a directory for the frequency index. Check filesystem
permissions!");
There
must be something at the system level stopping the directory being created (that is, the
directory
{$your_corpus_index_data_directory}/{$name_of_corpus}__freq
)being
created. If the directory is owned by www-data, perhaps you should check whether user has
permission to write to this directory?
On
the basis of the other issue you report, the problem may be that an earlier run of the
frequency-table
setup code has created a __freq directory that cannot
be deleted.
Then, mkdir fails, because the directory already exists (because it could not be
deleted).
The
message
ÃÆÃâÃââÃÆââââ¬Ã
¡ÃâìÃÆââ¬Â¦Ã¢ââ¬Ã
âhas
already been
createdÃÆÃâÃââÃÆââââ¬Ã
¡Ãâì
indicates that that __freq directory does exist (which should only be the case if frequency
table setup was successful).
My
advice: manually remove the __freq corpus (its data directory and registry file). Then check
all
relevant filesystem permissions. Then, once
youÃÆÃâÃââÃÆââââ¬Ã
¡ÃâìÃÆââââ¬Ã
¾Ãââre
sure
they are correct run frequency list setup from scratch.
If
in doubt use offline freqlists instead of the interface buttons; that may give more
informative
error messages if there is still a problem.
best
Andrew.
From:
"AndrÃÆÃâÃâ ââ¬â¢ÃÆââ¬Å¡Ãâés
ChandÃÆÃâÃâ ââ¬â¢ÃÆââ¬Å¡ÃâÃÂa"
[mailto:andres at chandia.net]
Sent: 13 October
2016 20:05
To: Hardie,
Andrew
Cc: Open source
development of the Corpus WorkBench
Subject: CQPweb
could not create a directory for the frequency index
I get this
error when I try to "Cretate the CWB Frequency table"
CQPweb could not create
a directory for the frequency index. Check filesystem permissions!
But there is
no error for text position records or frequency tables,
directories: registry,
data and cqpweb are all owned: www-data:www-data
but then if I refresh the Corpus
frequency list controls page, I can see: The text-by-text list for this corpus
has already been created.
Another problem that I still
have is the creation of new users:
A MySQL query did not run successfully!
Original query: INSERT INTO user_info ( username, realname, email, passhash,
acct_status, expiry_time, acct_create_time, conc_kwicview, conc_corpus_order, cqp_syntax,
context_with_tags, use_tooltips,
thin_default_reproducible, coll_statistic,
coll_freqtogether, coll_freqalone, coll_from, coll_to, max_dbsize, linefeed ) VALUES (
'jhuser', 'unknown person', 'jhuser at upf.edu',
'$2a$11$......................yl0Kl8h3xhUATbrHP7sKOq.GXmLA0AS', 0, 0, CURRENT_TIMESTAMP, 1, 1,
0, 0, 1, 1, , 5, 5, -3, 3, 1000000, 'au' ) /* from User: admin | Function:
do_append_mysql_comment() | 2016-Oct-13 19:02:22 */
Error # 1064: You have an
error in your SQL syntax; check the manual that corresponds to your MySQL server version for
the right syntax to use near ' 5, 5, -3, 3, 1000000, 'au' ) /* from User: admin |
Function: do' at line 38
Thanks
_______________________
andrÃÆÃâÃâ ââ¬â¢ÃÆââ¬Å¡Ãâés
chandÃÆÃâÃâ ââ¬â¢ÃÆââ¬Å¡ÃâÃÂa
[IMAGE REMOVED]
NMT |
Dungupeyem |
Corlexim
administrador de:
Parles.upf |
Delingua |
Amind terapia |
Mapuche koyaktu |
Nocando |
mail:
ONG Mapuche koyaktu |
Psicoaching |
P No imprima innecesariamente.
ÃÆââ¬Å¡ÃâáCuide el medio
ambiente!
_______________________
andr̮̩s
chandÃÆÃÂa
[IMAGE REMOVED]
NMT |
Dungupeyem |
Corlexim
administrador de:
Parles.upf |
Amind terapia |
Mapuche koyaktu |
Nocando |
mail:
ONG Mapuche koyaktu |
Psicoaching |
P
No imprima innecesariamente.
ÃâáCuide el medio ambiente!
_______________________
andrés chandÃÂa
NMT |
Dungupeyem |
Corlexim
administrador de:
Parles.upf |
Amind terapia | Mapuche koyaktu |
Nocando
|
mail: ONG Mapuche koyaktu |
Psicoaching |
P
No imprima innecesariamente. áCuide el medio
ambiente!
_______________________
            andrés
chandÃa
NMT |
Dungupeyem | Corlexim
administrador de:
Parles.upf | Amind
terapia | Mapuche koyaktu | Nocando |
mail: ONG Mapuche koyaktu | Psicoaching |
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20170124/f12069ca/attachment-0001.html>
More information about the CWB
mailing list