[CWB] [ cwb-Feature Requests-2898822 ] CQPweb: preserve metadata table design through re-indexing

SourceForge.net noreply at sourceforge.net
Tue Nov 17 02:15:52 CET 2009


Feature Requests item #2898822, was opened at 2009-11-17 01:15
Message generated for change (Tracker Item Submitted) made by andrewhardie
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=722306&aid=2898822&group_id=131809

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: CQPweb
Group: None
Status: Open
Priority: 4
Private: No
Submitted By: Andrew Hardie (andrewhardie)
Assigned to: Andrew Hardie (andrewhardie)
Summary: CQPweb: preserve metadata table design through re-indexing

Initial Comment:
At the moment, if a corpus is deleted, its metadata table (including all specifications of the fields) are wiped completely.

It might be desirable to be able preserve the design when the actual data (the rows of the metadata table, plus of course the CWB data if it's an internal corpus)  is deleted, so it doesn't all have to be re-typed (handles, descriptions etc.) when you re-index a corpus e.g. if texts have been added or altered.

One way to implement this: have a db of metadata table **designs**. Allow them to be generated from existing corpora. This would contain the create, insert statements etc. used when that metadata table was set up. Once generated, the designs exist indepedently of any corpus and persist when their parent corpus is deleted. And can also be applied to new corpora (ie, instead of specifying all the fields on the usual "install text metadata" form in "manage metadata", the design template is indicated instead.) -- so the design template created pre-deletion is applied to the new version of the corpus when it is set up again, or indeed any other corpus for which it is appropriate.

This would make re-indexing of a corpus much quicker and would enhance the possibility of using CQPweb with corpora that are in any way "under development" and not completely fixed datasets (ie require frequent refreshing/reindexing). Not an urgent feature, however, although probably quite straightforward to code up.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=722306&aid=2898822&group_id=131809


More information about the CWB mailing list