[CWB] Escape "<" and ">" symbols

Stefan Evert stefanML at collocations.de
Wed Feb 21 09:16:11 CET 2018



> On 20 Feb 2018, at 17:57, mansur <6688000 at gmail.com> wrote:
> 
> Could you explain how to escape "<" and ">" symbols in morphological tags, that produces Apertium's analyser? 
> cwb-encode tries to parse them as structural tags along with <s> and <text>.

Ruprecht gave the proper answer, but there's no need to do that unless the < character occurs at the start of the line.  If the morphological tags are always in the second column, cwb-encode will simply treat them as plain strings.

You'll get complaints about the empty <g/> elements, of course.

Best,
Stefan


More information about the CWB mailing list