[CWB] Exporting an aligned corpus

Alberto Simões ambs at di.uminho.pt
Sun Oct 31 22:36:22 CET 2010


Hello

I would like to process, programmatically, a parallel corpora. I can do 
this in two different ways:

- using some kind of C/Perl API that exports a cursor or iterator, that 
lets me look to each "sentence" pair at a time,

- or perform a textual dump of the aligned corpora to any suitable 
format, and deal with that format on my application.

What is the best approach? What should I use?

Thank you
Alberto
-- 
Alberto Simões


More information about the CWB mailing list