[CWB] Short sentences inconsistent alignment

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu Dec 27 03:00:31 CET 2018


2 possibilities come to mind: either there is something wrong with the a-attribute, or there is a bug in the rendering code. Before bug hunting: Can you check that these s elements really are aligned with one another in the underlying a-attribute? Thanks

best

Andrew

From: cwb-bounces at sslmit.unibo.it <cwb-bounces at sslmit.unibo.it> On Behalf Of "Andrés Chandía"
Sent: 26 December 2018 15:27
To: cwb at sslmit.unibo.it
Subject: [CWB] Short sentences inconsistent alignment

Hi there, I have a corpus where some of the frases are really short ones, even a word long, so when I do a query usually at showing the parallel corpora shows me either a previous or a subsequent frase eventhow the alignment is well done at the corpus.

I have the all corpus configured this way: Display setting: show 1 of XML element: Structure ``s'' (s)

For instalce if I search the word "wentru" (corpus are Mapudungun/English and Mapudungun/Spanish)

at md/en I get: küme wentru | the other side / to the other side

at md/es I get: küme wentru | llegó a un lago

while the aligned sentences are:

md
<s id="73">
küme    [@AJ]    [@AJ][küme=bueno.-agradable]
wentru    [@NN]    [@NN][wentru=hombre]
</s>

en
<s id="73">
a    a    DT    0.998827
good    good    JJ    0.967564
man    man    NN    0.989314
</s>


es
<s id="73">
buen    bueno    AQ0MS00    1
hombre    hombre    NCMS000    0.990108
</s>


Is there a trick to get the correct alingned senteces?



_______________________
            andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net/>[Image removed by sender.]<https://twitter.com/chandianet>
Dungupeyem<http://chandia.net/content/dungupeyem> | IECMap<http://chandia.net/content/iecmap> | ISECMap<http://chandia.net/content/isecmap> | NMT<http://chandia.net/content/nmt> | Corlexim<http://corlexim.cl>

administrador de:
Parles.upf<http://parles.upf.edu> | IWCH<https://iwch.upf.edu> | Amind terapia<http://amindterapia.com> | ONG Mapuche koyaktu<http://koyaktumapuche.net> | Nocando<http://parles.upf.edu/llocs/nocando> | IAC<https://iac.upf.edu> | CddZ<https://iac.upf.edu/cddz> | ISAC<https://iac.upf.edu/isac> | CatCg<http://catcg.upf.edu>
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20181227/8bb42513/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: ~WRD000.jpg
Type: image/jpeg
Size: 823 bytes
Desc: ~WRD000.jpg
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20181227/8bb42513/attachment.jpg>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image002.jpg
Type: image/jpeg
Size: 338 bytes
Desc: image002.jpg
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20181227/8bb42513/attachment-0001.jpg>


More information about the CWB mailing list