[CWB] Escape "<" and ">" symbols
Stefan Evert
stefanML at collocations.de
Wed Feb 21 09:16:11 CET 2018
> On 20 Feb 2018, at 17:57, mansur <6688000 at gmail.com> wrote:
>
> Could you explain how to escape "<" and ">" symbols in morphological tags, that produces Apertium's analyser?
> cwb-encode tries to parse them as structural tags along with <s> and <text>.
Ruprecht gave the proper answer, but there's no need to do that unless the < character occurs at the start of the line. If the morphological tags are always in the second column, cwb-encode will simply treat them as plain strings.
You'll get complaints about the empty <g/> elements, of course.
Best,
Stefan
More information about the CWB
mailing list