[CWB] [cwb:bugs] #59 cwb-makeall aborts uninformatively if there are too many attributes
Andrew Hardie
andrewhardie at users.sf.net
Mon Jun 30 00:09:04 CEST 2014
- **status**: open --> closed-wont-fix
---
** [bugs:#59] cwb-makeall aborts uninformatively if there are too many attributes**
**Status:** closed-wont-fix
**Group:** TODO-4.0
**Labels:** CL low-level library
**Created:** Fri Sep 13, 2013 11:15 AM UTC by Andrew Hardie
**Last Updated:** Sun Jun 29, 2014 10:08 PM UTC
**Owner:** Andrew Hardie
It is possible to create a corpus using cwb-encode that cannot be processed by cwb-makeall (or any tool that uses the CL, but cwb-makeall is the one where you notice it!) because it has too many attributes.
The bug is as follows: the YACC parser produces an error, with the message printed twice:
syntax error
Parse Error
(plus boiler plate inserted by cregerror_cleanup())
This seems to arise from the "error" component in the Yacc grammar in parser.y.
I triggered this error with S-attributes (I was indexing *lots* of XML elements, each of which had 3 or 4 attributes). Removing most of them from the registry file made the problem go away. Thus, I infer that the problem was caused by too many attributes - I assume it could not have been a syntax error in the reg file, because that was written by cwb-encode!
Solution: I am not even sure one is needed because this entire area of the code will go away in v 3.9/4.0, and it is clearly not hampering most users. If there *is* an effective maximum number of attributes, perhaps that should be mentioned in the cwb-encode man file for 3.5.
---
Sent from sourceforge.net because cwb at sslmit.unibo.it is subscribed to https://sourceforge.net/p/cwb/bugs/
To unsubscribe from further messages, a project admin can change settings at https://sourceforge.net/p/cwb/admin/bugs/options. Or, if this is a mailing list, you can unsubscribe from the mailing list.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20140629/de4294fc/attachment.html>
More information about the CWB
mailing list