[CWB] Importing large CoNLL-U corpora

Fabricio Chalub fcbrbr at gmail.com
Thu Nov 30 16:53:08 CET 2017


Hi,

we want to import about 170K files of CoNLL-U files into CWB/CQPweb
(at least the POS and lemma parts as I understand that dependencies
are not supported).

I was wondering if anyone here has any scripts already written for
this task, even if they are temporary hacks.  Any pointers?

cheers,
Fabricio
-- 
Fabricio Chalub
http://fcbr.github.io/
http://researcher.ibm.com/person/br-fchalub


More information about the CWB mailing list