[CWB] "Segmentation fault (core dumped)" on various versions
Scott Sadowsky
ssadowsky at gmail.com
Wed Jul 24 05:30:16 CEST 2013
Something very strange is going on. I've replaced my index for this corpus
with a third backup copy, and the following happened:
PERS-DIVER-USENET> "jai"
0 matches.
PERS-DIVER-USENET> ".+ai"
Segmentation fault (core dumped)
Here the search for "jai", which previously caused a segfault, worked. So
all seemed good. But the search returned 0 hits, instead of the 1 which is
returned by the command cwb-lexdecode -f -p '.ai' PERS-DIVER-USENET. So
something isn't adding up here.
Immediately after this, I ran the folloiwng
$ cqp -eC
Welcome to CQP -- the Colourful Query Processor .
[30: NBUS 31: NBUS 32: NBUS 33: NBUS 34: NBUS 35: NBUS 36: NBUS 37:
NBUS ]
[40: NBUS 41: NBUS 42: NBUS 43: NBUS 44: NBUS 45: NBUS 46: NBUS 47:
NBUS ]
[no corpus]> PERS-DIVER-USENET
PERS-DIVER-USENET> "jai"
Segmentation fault (core dumped)
$
Now when I try the same search for "jai" that worked a minute ago, it
segfaults.
I suspect the next step is to rebuild the index from scratch, but that
involves decompressing a ZIP file with 1.2 million files inside it, which
I'd rather avoid if at all possible.
Cheers,
Scott
On Tue, Jul 23, 2013 at 11:11 AM, Stefan Evert <stefanML at collocations.de>wrote:
> Thanks.
>
> Now, could you please try the following queries in CQP on this corpus?
>
> [word = "Tai"]
> [word = "dai"]
> [word = "tai"]
> etc.
>
> If some of them crash, but others don't, your index is probably damaged.
> Otherwise, we'll have to dig deeper. How did you compile and install CWB?
>
> Cheers,
> Stefan
>
>
> On 23 Jul 2013, at 14:57, Scott Sadowsky <ssadowsky at gmail.com> wrote:
>
> > That worked just fine. Here's the output:
> >
> > $ cwb-lexdecode -f -p '.ai' PERS-DIVER-USENET
> > 165 Tai
> > 57 dai
> > 357 tai
> > 7 Mai
> > 3 Kai
> > 357 vai
> > 23 Vai
> > 6 rai
> > 81 cai
> > 13 sai
> > 4 Dai
> > 1 uai
> > 32 hai
> > 1 Jai
> > 23 mai
> > 7 fai
> > 2 lai
> > 2 Wai
> > 2 wai
> > 6 nai
> > 2 pai
> > 5 Sai
> > 4 bai
> > 4 Cai
> > 5 kai
> > 4 gai
> > 1 Bai
> > 2 Rai
> > 1 Lai
> > 1 Fai
> > 1 jai
> > $
> >
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130723/91677d46/attachment.html>
More information about the CWB
mailing list