[CWB] other kind of annotations in cwb corpus

Yannick Versley yversley at gmail.com
Mon Feb 14 17:15:06 CET 2011


Hi Luigi,

it is possible to create additional "slots" (i.e. P-attributes) for the
whole corpus
by importing them using cwb-encode with the right settings, adding the
column
to the registry entry, and then re-running cwb-makeall. This is most useful
when
you have run it on the whole corpus (and you are sure that the tokenization
did
not change), as you need to have the additional slot values for the whole
corpus
(and the whole process is somewhat error-prone since there is no way to tell
cwb-encode "verify that column 1 is still equal to the word column, and add
column 2 as the morph column").

The (conceptually) simpler way to do this would be to dump the whole corpus
(using cwb-decode), run your favorite tools on it to get a version with the
additional
annotations, and then replace the old data directory with the cwb-encode'd
version
of your new, enriched version of the corpus.

Best wishes,
Yannick Versley

On Mon, Feb 14, 2011 at 4:25 PM, luigi.talamo at libero.it <
luigi.talamo at libero.it> wrote:

> Hi there,
> referring to figure 1 in the CQP tutorial, I was wondering if it is
> possible
> to annotate a corpus with further annotations, say, morphological
> derivation,
> prosodic information and so on...
> Actually, I think it is already realized in SSLMIT's released corpora,
> REPUBBLICA and ITWAC where you can get (some) inflectional information on a
> word:
>
> parlerebbero/VER:fin
>
> However, this kind of annotation is 'embedded' in the POS slot: it is
> possible
> to create further slots to accommodate above mentioned information ?
>
> I'm sorry if it was a trivial question, but I cannot find anything related
> to
> it in the documentation.
>
> Cheers,
>
> Luigi
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20110214/0f45bbdd/attachment.htm


More information about the CWB mailing list