[CWB] [ cwb-Bugs-3586224 ] cwb-s-encode should also check character encoding

SourceForge.net noreply at sourceforge.net
Sun Nov 11 18:39:39 CET 2012


Bugs item #3586224, was opened at 2012-11-11 09:39
Message generated for change (Tracker Item Submitted) made by schtepf
You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=722303&aid=3586224&group_id=131809

Please note that this message will contain a full copy of the comment thread,
including the initial issue submission, for this request,
not just the latest update.
Category: Command-line utilities
Group: None
Status: Open
Resolution: None
Priority: 6
Private: No
Submitted By: Stefan Evert (schtepf)
Assigned to: Nobody/Anonymous (nobody)
Summary: cwb-s-encode should also check character encoding

Initial Comment:
In CWB 3.4+, cwb-encode requires character encoding to be declared, validates input (to some extent) and converts UTF-8 strings to standard normalized form.  cwb-s-encode should do the same for an attribute with annotations and require an encoding declaration in this case.

----------------------------------------------------------------------

You can respond by visiting: 
https://sourceforge.net/tracker/?func=detail&atid=722303&aid=3586224&group_id=131809


More information about the CWB mailing list