[CWB] Assertion failed

Ruprecht von Waldenfels ruprecht.waldenfels at gmx.net
Fri Dec 18 13:06:20 CET 2015


Yes, it involves parallel corpora. This is a mistake I get when I use a 
lot of aligned corpora, part of which have a  lot of positional 
attributes with extensive word alignment information.

And yes, the command itself does not give a mistake:
BULGAKOVMASTER_RU;  show +bulgakovmaster_uk; show +uk; show 
+bulgakovmaster_uka; show +uka; show +bulgakovmaster_be; show +be; show 
+bulgakovmaster_pl; show +pl; show +bulgakovmaster_pla; show +pla; show 
+bulgakovmaster_cz; show +cz; show +bulgakovmaster_sk; show +sk; show 
+bulgakovmaster_sl; show +sl; show +bulgakovmaster_hr; show +hr; show 
+bulgakovmaster_sr; show +sr; show +bulgakovmaster_sra; show +sra; show 
+bulgakovmaster_mk; show +mk; show +bulgakovmaster_bg; show +bg; show 
+bulgakovmaster_de; show +bulgakovmaster_ee;  show +bulgakovmaster_el; 
show +bulgakovmaster_en;  show +bulgakovmaster_eo;  show 
+bulgakovmaster_es;  show +bulgakovmaster_fr;  show +bulgakovmaster_hu; 
  show +bulgakovmaster_hy;  show +bulgakovmaster_it;  show 
+bulgakovmaster_lv;  show +bulgakovmaster_nl;  show +bulgakovmaster_pt; 
  show +bulgakovmaster_ro; set PrintMode sgml; show +tag; show +tag2; 
show +lemma; show +id; set Context 1s;

A=@[tag="^V.*" & (  ( lemma=".*^видеть$.*" & tag=".*^V.*.*" )  |  ( 
lemma=".*^увидеть$.*" & tag=".*^V.*.*" ) | uk contains 
"=w:.*?=i:.*?=l:бачити=t:V.*.*" | uk contains 
"=w:.*?=i:.*?=l:побачити=t:V.*.*" | uka contains 
"=w:.*?=i:.*?=l:бачити=t:V.*.*" | uka contains 
"=w:.*?=i:.*?=l:побачити=t:V.*.*" | pl contains 
"=w:.*?=i:.*?=l:widzieć=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*" 
| pl contains 
"=w:.*?=i:.*?=l:(zobaczyć|ujrzeć)=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*" 
| pla contains 
"=w:.*?=i:.*?=l:widzieć=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*" 
| pla contains 
"=w:.*?=i:.*?=l:(zobaczyć|ujrzeć)=t:(fin|praet|inf|imps|impt|pact|ppas|pcon|pant|ger).*.*" 
| cz contains "=w:.*?=i:.*?=l:(ne)?vidět=t:V.*.*" | cz contains 
"=w:.*?=i:.*?=l:(ne)?uvidět=t:V.*.*" | sk contains 
"=w:.*?=i:.*?=l:(ne)?vidieť=t:Vm.*.*" | sk contains 
"=w:.*?=i:.*?=l:(ne)?zazrieť=t:Vm.*.*" | sl contains 
"=w:.*?=i:.*?=l:videti=t:Gg.*.*" | sl contains 
"=w:.*?=i:.*?=l:zagledati=t:Gg.*.*" | hr contains 
"=w:.*?=i:.*?=l:vidje.*=t:Vm.*.*" | hr contains 
"=w:.*?=i:.*?=l:(ugleda.*|ugled)=t:(Vm|N).*.*" | sr contains 
"=w:.*?=i:.*?=l:(vidj?e.*|video)=t:(Vm|N).*.*" | sr contains 
"=w:.*?=i:.*?=l:(ugleda.*|ugled)=t:(Vm|N).*.*" | sra contains 
"=w:.*?=i:.*?=l:(vidj?e.*|video)=t:(Vm|N).*.*" | sra contains 
"=w:.*?=i:.*?=l:(ugleda.*|ugled)=t:(Vm|N).*.*" | mk contains 
"=w:.*?=i:.*?=l:гледа.*=t:[vV].*.*" | mk contains 
"=w:.*?=i:.*?=l:види=t:[vV].*.*" | bg contains 
"=w:.*?=i:.*?=l:виждам?=t:V(L|2).*.*" | bg contains 
"=w:.*?=i:.*?=l:видя=t:V(L|2).*.*"  )];

rather,
reduce A to 50; cat A;
produces:

BULGAKOVMASTER_RU> cat A > "tst.txt"
cqp: concordance.c:572: compose_kwic_line: Assertion `match_start >= 0 
&& match_start < text_size' failed.
Abgebrochen (Speicherabzug geschrieben)

I have to say this is after I copied the alignments from a different 
version of BULGAKOVMASTER_RU. Maybe this is somehow connected. I am not 
sure it all worked as it should, there is a glitch with the original 
alignments with this corpus that I still have to look into in some detail.

Thanks! Best,
Ruprecht


More information about the CWB mailing list