[CWB] Finding bad non-category-handle values

Hardie, Andrew a.hardie at lancaster.ac.uk
Sat Sep 24 08:07:54 CEST 2016


Try a CQP query for

<whichever_att=".*[^a-zA-Z0-9_].*">[]

and then  tabulate match whichever_att ?

best

Andrew.

From: cwb-bounces at sslmit.unibo.it [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of Scott Sadowsky
Sent: 24 September 2016 04:10
To: Open source development of the Corpus WorkBench
Cc: Open source development of the Corpus WorkBench
Subject: [CWB] Finding bad non-category-handle values

I'm attempting to import a corpus into CQPweb, and when I try to change one of the s-attributes from "free text" to "classification", I get the following error:

The datatype of text_source cannot be changed to [classification], because there are non-category-handle values in the CWB index.

I understand this to mean that in one or more values of text_source, there's a character that's not a-z or _. My question is simply how do I get a list of these values in order to figure out which one is causing the problem and then fix it?

Thanks in advance!
Scott
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20160924/05216bb5/attachment.html>


More information about the CWB mailing list