[CWB] copying alignments

Stefan Evert stefanML at collocations.de
Thu Dec 17 08:27:11 CET 2015


> On 16 Dec 2015, at 18:59, Stefan Evert <stefanML at collocations.de> wrote:
> 
> (Side remark: cwb-align-encode and cwb-align-decode also use corpus positions.  Only the aligner software and cwb-align-import reference a structural grid.  If your alignment comes from an external software, it's definitely safer and better to re-import it for each corpus.)

I just remembered that there's also a cwb-align-export script, which pairs up with cwb-align-import.  This should be the safest way of copying a sentence alignment from one corpus to another – even if you've changed tokenization or reshuffled texts (as long as sentence IDs haven't changed).

Cheers,
Stefan


More information about the CWB mailing list