[CWB] [ cwb-Feature Requests-2891967 ] Read undump files without
explicit line count
SourceForge.net
noreply at sourceforge.net
Wed Nov 4 15:57:52 CET 2009
Feature Requests item #2891967, was opened at 2009-11-04 15:57
Message generated for change (Tracker Item Submitted) made by schtepf
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=722306&aid=2891967&group_id=131809
Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: CWB engine
Group: None
Status: Open
Priority: 6
Private: No
Submitted By: Stefan Evert (schtepf)
Assigned to: Stefan Evert (schtepf)
Summary: Read undump files without explicit line count
Initial Comment:
The "undump" command in CQP requires an explicit line count header in the first line of the undump file, so that arrays can be pre-allocated. This is a major hassle for exchanging data with spreadsheets, SQL database engines, R, and other software that would otherwise work quite well with the TAB-delimited format of dump/undump files. Without this restriction, it would also be possible to use dump files as a platform-independent serialization format for query results (unlike "save", which produces unportable binary files that even store the registry directory of the base corpus).
----------------------------------------------------------------------
You can respond by visiting:
https://sourceforge.net/tracker/?func=detail&atid=722306&aid=2891967&group_id=131809
More information about the CWB
mailing list