[CWB] "Segmentation fault (core dumped)" on various versions

Scott Sadowsky ssadowsky at gmail.com
Wed Jul 24 05:30:16 CEST 2013


Something very strange is going on. I've replaced my index for this corpus
with a third backup copy, and the following happened:

PERS-DIVER-USENET> "jai"
0 matches.
PERS-DIVER-USENET> ".+ai"
Segmentation fault (core dumped)

Here the search for "jai", which previously caused a segfault, worked. So
all seemed good. But the search returned 0 hits, instead of the 1 which is
returned by the command cwb-lexdecode -f -p '.ai' PERS-DIVER-USENET. So
something isn't adding up here.

Immediately after this, I ran the folloiwng

$ cqp -eC
Welcome to CQP -- the  Colourful  Query  Processor .
[30: NBUS  31: NBUS  32: NBUS  33: NBUS  34: NBUS  35: NBUS  36: NBUS  37:
NBUS  ]
[40: NBUS  41: NBUS  42: NBUS  43: NBUS  44: NBUS  45: NBUS  46: NBUS  47:
NBUS  ]
[no corpus]> PERS-DIVER-USENET
PERS-DIVER-USENET> "jai"
Segmentation fault (core dumped)
$

Now when I try the same search for "jai" that worked a minute ago, it
segfaults.

I suspect the next step is to rebuild the index from scratch, but that
involves decompressing a ZIP file with 1.2 million files inside it, which
I'd rather avoid if at all possible.

Cheers,
Scott



On Tue, Jul 23, 2013 at 11:11 AM, Stefan Evert <stefanML at collocations.de>wrote:

> Thanks.
>
> Now, could you please try the following queries in CQP on this corpus?
>
>         [word = "Tai"]
>         [word = "dai"]
>         [word = "tai"]
>         etc.
>
> If some of them crash, but others don't, your index is probably damaged.
> Otherwise, we'll have to dig deeper.  How did you compile and install CWB?
>
> Cheers,
> Stefan
>
>
> On 23 Jul 2013, at 14:57, Scott Sadowsky <ssadowsky at gmail.com> wrote:
>
> > That worked just fine. Here's the output:
> >
> > $ cwb-lexdecode -f -p '.ai' PERS-DIVER-USENET
> >     165 Tai
> >      57 dai
> >     357 tai
> >       7 Mai
> >       3 Kai
> >     357 vai
> >      23 Vai
> >       6 rai
> >      81 cai
> >      13 sai
> >       4 Dai
> >       1 uai
> >      32 hai
> >       1 Jai
> >      23 mai
> >       7 fai
> >       2 lai
> >       2 Wai
> >       2 wai
> >       6 nai
> >       2 pai
> >       5 Sai
> >       4 bai
> >       4 Cai
> >       5 kai
> >       4 gai
> >       1 Bai
> >       2 Rai
> >       1 Lai
> >       1 Fai
> >       1 jai
> > $
> >
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130723/91677d46/attachment.html>


More information about the CWB mailing list