[CWB] Alignment format
Alberto Simões
ambs at di.uminho.pt
Thu Feb 4 21:47:53 CET 2010
Hello, Serge.
Thanks for the answer.
I know how to use easyalign and align a bitext.
In this case I have the sentence aligned bitext and want to add it to
CQP. Was hoping there was a simple textual file format I could use for
the alignment.
Unfortunately the document you suggested seems not to include that
information. Especially because the section on alignment attributes is
empty ;)
Thanks,
Alberto
On 04/02/2010 19:38, Heiden Serge wrote:
> Alberto,
>
> the "Corpus Administrator’s Manual"
> that you can find here :
> http://bulba.sdsu.edu/technical-manual.ps
> gives you instructions on how to align to different
> CWB corpus on a specific structural attribute
> (not positional) like your 's' .
>
> Regards,
> Serge
>
> Selon Alberto Simões le 04/02/2010 16:46:
>> Hello
>>
>> I was looking to the encode tutorial but it misses the alignment part :)
>> I would like to know how is alignment encoded. Is it as a common
>> attribute?
>>
>> Let's say:
>>
>> <s>
>> I 1
>> saw 1
>> a 1
>> cat 1
>> </s>
>> <s>
>> The 2
>> house 2
>> is 2
>> blue 2
>> </s>
>>
>> and
>>
>> <s>
>> Eu 1
>> vi 1
>> um 1
>> gato 1
>> </s>
>> <s>
>> A 2
>> casa 2
>> é 2
>> azul 2
>> </s>
>>
>> Is this the case?
>> If so, identifiers can be used in multiple sentences?
>>
>> <s>
>> Yes 1
>> ! 1
>> </s>
>> <s>
>> Sure 1
>> ! 1
>> </s>
>>
>> and
>>
>> <s>
>> Sim 1
>> , 1
>> claro 1
>> ! 1
>> </s>
>>
>> Thanks
>> Alberto
>>
--
Alberto Simões
More information about the CWB
mailing list