[CWB] Alignment format

Alberto Simões ambs at di.uminho.pt
Thu Feb 4 21:47:53 CET 2010


Hello, Serge.

Thanks for the answer.

I know how to use easyalign and align a bitext.
In this case I have the sentence aligned bitext and want to add it to
CQP. Was hoping there was a simple textual file format I could use for
the alignment.

Unfortunately the document you suggested seems not to include that
information. Especially because the section on alignment attributes is
empty ;)

Thanks,
Alberto

On 04/02/2010 19:38, Heiden Serge wrote:
> Alberto,
> 
> the "Corpus Administrator’s Manual"
> that you can find here :
> http://bulba.sdsu.edu/technical-manual.ps
> gives you instructions on how to align to different
> CWB corpus on a specific structural attribute
> (not positional) like your 's' .
> 
> Regards,
> Serge
> 
> Selon Alberto Simões le 04/02/2010 16:46:
>> Hello
>>
>> I was looking to the encode tutorial but it misses the alignment part :)
>> I would like to know how is alignment encoded. Is it as a common
>> attribute?
>>
>> Let's say:
>>
>> <s>
>> I    1
>> saw  1
>> a    1
>> cat  1
>> </s>
>> <s>
>> The   2
>> house 2
>> is    2
>> blue  2
>> </s>
>>
>> and
>>
>> <s>
>> Eu    1
>> vi    1
>> um    1
>> gato  1
>> </s>
>> <s>
>> A     2
>> casa  2
>> é     2
>> azul  2
>> </s>
>>
>> Is this the case?
>> If so, identifiers can be used in multiple sentences?
>>
>> <s>
>> Yes 1
>> !   1
>> </s>
>> <s>
>> Sure 1
>> !    1
>> </s>
>>
>> and
>>
>> <s>
>> Sim   1
>> ,     1
>> claro 1
>> !     1
>> </s>
>>
>> Thanks
>> Alberto
>>    

-- 
Alberto Simões


More information about the CWB mailing list