[CWB] Assertion failed
Ruprecht von Waldenfels
ruprecht.waldenfels at gmx.net
Fri Dec 18 13:06:20 CET 2015
Yes, it involves parallel corpora. This is a mistake I get when I use a
lot of aligned corpora, part of which have a lot of positional
attributes with extensive word alignment information.
And yes, the command itself does not give a mistake:
BULGAKOVMASTER_RU; show +bulgakovmaster_uk; show +uk; show
+bulgakovmaster_uka; show +uka; show +bulgakovmaster_be; show +be; show
+bulgakovmaster_pl; show +pl; show +bulgakovmaster_pla; show +pla; show
+bulgakovmaster_cz; show +cz; show +bulgakovmaster_sk; show +sk; show
+bulgakovmaster_sl; show +sl; show +bulgakovmaster_hr; show +hr; show
+bulgakovmaster_sr; show +sr; show +bulgakovmaster_sra; show +sra; show
+bulgakovmaster_mk; show +mk; show +bulgakovmaster_bg; show +bg; show
+bulgakovmaster_de; show +bulgakovmaster_ee; show +bulgakovmaster_el;
show +bulgakovmaster_en; show +bulgakovmaster_eo; show
+bulgakovmaster_es; show +bulgakovmaster_fr; show +bulgakovmaster_hu;
show +bulgakovmaster_hy; show +bulgakovmaster_it; show
+bulgakovmaster_lv; show +bulgakovmaster_nl; show +bulgakovmaster_pt;
show +bulgakovmaster_ro; set PrintMode sgml; show +tag; show +tag2;
show +lemma; show +id; set Context 1s;
A=@[tag="^V.*" & ( ( lemma=".*^видеть$.*" & tag=".*^V.*.*" ) | (
lemma=".*^увидеть$.*" & tag=".*^V.*.*" ) | uk contains
"=w:.*?=i:.*?=l:бачити=t:V.*.*" | uk contains
"=w:.*?=i:.*?=l:побачити=t:V.*.*" | uka contains
"=w:.*?=i:.*?=l:бачити=t:V.*.*" | uka contains
"=w:.*?=i:.*?=l:побачити=t:V.*.*" | pl contains
"=w:.*?=i:.*?=l:widzieć=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*"
| pl contains
"=w:.*?=i:.*?=l:(zobaczyć|ujrzeć)=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*"
| pla contains
"=w:.*?=i:.*?=l:widzieć=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*"
| pla contains
"=w:.*?=i:.*?=l:(zobaczyć|ujrzeć)=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*"
| cz contains "=w:.*?=i:.*?=l:(ne)?vidět=t:V.*.*" | cz contains
"=w:.*?=i:.*?=l:(ne)?uvidět=t:V.*.*" | sk contains
"=w:.*?=i:.*?=l:(ne)?vidieť=t:Vm.*.*" | sk contains
"=w:.*?=i:.*?=l:(ne)?zazrieť=t:Vm.*.*" | sl contains
"=w:.*?=i:.*?=l:videti=t:Gg.*.*" | sl contains
"=w:.*?=i:.*?=l:zagledati=t:Gg.*.*" | hr contains
"=w:.*?=i:.*?=l:vidje.*=t:Vm.*.*" | hr contains
"=w:.*?=i:.*?=l:(ugleda.*|ugled)=t:(Vm|N).*.*" | sr contains
"=w:.*?=i:.*?=l:(vidj?e.*|video)=t:(Vm|N).*.*" | sr contains
"=w:.*?=i:.*?=l:(ugleda.*|ugled)=t:(Vm|N).*.*" | sra contains
"=w:.*?=i:.*?=l:(vidj?e.*|video)=t:(Vm|N).*.*" | sra contains
"=w:.*?=i:.*?=l:(ugleda.*|ugled)=t:(Vm|N).*.*" | mk contains
"=w:.*?=i:.*?=l:гледа.*=t:[vV].*.*" | mk contains
"=w:.*?=i:.*?=l:види=t:[vV].*.*" | bg contains
"=w:.*?=i:.*?=l:виждам?=t:V(L|2).*.*" | bg contains
"=w:.*?=i:.*?=l:видя=t:V(L|2).*.*" )];
rather,
reduce A to 50; cat A;
produces:
BULGAKOVMASTER_RU> cat A > "tst.txt"
cqp: concordance.c:572: compose_kwic_line: Assertion `match_start >= 0
&& match_start < text_size' failed.
Abgebrochen (Speicherabzug geschrieben)
I have to say this is after I copied the alignments from a different
version of BULGAKOVMASTER_RU. Maybe this is somehow connected. I am not
sure it all worked as it should, there is a glitch with the original
alignments with this corpus that I still have to look into in some detail.
Thanks! Best,
Ruprecht
More information about the CWB
mailing list