<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
        {font-family:"Cambria Math";
        panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
        {font-family:Calibri;
        panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
        {font-family:Verdana;
        panose-1:2 11 6 4 3 5 4 4 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
        {margin:0cm;
        font-size:11.0pt;
        font-family:"Calibri",sans-serif;
        mso-fareast-language:EN-US;}
a:link, span.MsoHyperlink
        {mso-style-priority:99;
        color:#0563C1;
        text-decoration:underline;}
span.E-MailFormatvorlage20
        {mso-style-type:personal-reply;
        font-family:"Calibri",sans-serif;
        color:windowtext;}
.MsoChpDefault
        {mso-style-type:export-only;
        font-size:10.0pt;}
@page WordSection1
        {size:612.0pt 792.0pt;
        margin:70.85pt 70.85pt 2.0cm 70.85pt;}
div.WordSection1
        {page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]-->
</head>
<body lang="DE-CH" link="#0563C1" vlink="#954F72" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span lang="EN-US">I figured it out </span><span style="font-family:"Segoe UI Emoji",sans-serif">😊</span>
<span lang="EN-US">We had differently formatted the text tags because we had collected data from different sources. After “standardizing” the tags, everything works. The problems with searching by lemma were due to the windows formatting of line breaks; as
soon as we changed it to linux line breaks, it worked again.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Thank you for you help and best regards<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Bojan<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<div>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">------------------------------------------------------------<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">lic. phil. Bojan Peric<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">Wissenschaftlicher Mitarbeiter<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="mso-fareast-language:DE-CH">ZHAW School of Management and Law<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">Gertrudstrasse 15<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">CH-8400 Winterthur<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">perc@zhaw.ch<o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span lang="DE" style="mso-fareast-language:DE-CH">Von:</span></b><span lang="DE" style="mso-fareast-language:DE-CH"> cwb-bounces@sslmit.unibo.it <cwb-bounces@sslmit.unibo.it>
<b>Im Auftrag von </b>Peric Bojan (perc)<br>
<b>Gesendet:</b> Mittwoch, 18. Mai 2022 15:51<br>
<b>An:</b> Open source development of the Corpus WorkBench <cwb@sslmit.unibo.it><br>
<b>Betreff:</b> [CWB] Bad metadata value on input file<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal">Hi Andrew<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span lang="EN-US">Thank you very much for your reply. I’m not sure if “blanks” means “space characters” or “empty attributes”, so I tried to remove space characters and fill in dummy values if there is an empty attribute. So now there
are neither space characters nor empty attributes, however, the problem persists. By the way, I can’t change the data type to classification under “Manage text metadata”, all handles are automatically set to free text and are not changeable. It doesn’t matter
which boxes I check, I get the same error.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Funnily enough, the corpus works in CWB – at least more or less, searching by lemma does not.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">What am I missing?<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Many thanks<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Bojan<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<div>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">------------------------------------------------------------<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">lic. phil. Bojan Peric<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">Wissenschaftlicher Mitarbeiter<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="mso-fareast-language:DE-CH">ZHAW School of Management and Law<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">Gertrudstrasse 15<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">CH-8400 Winterthur<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><a href="mailto:perc@zhaw.ch">perc@zhaw.ch</a><o:p></o:p></span></p>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span lang="DE" style="mso-fareast-language:DE-CH">Von:</span></b><span lang="DE" style="mso-fareast-language:DE-CH">
<a href="mailto:cwb-bounces@sslmit.unibo.it">cwb-bounces@sslmit.unibo.it</a> <<a href="mailto:cwb-bounces@sslmit.unibo.it">cwb-bounces@sslmit.unibo.it</a>>
<b>Im Auftrag von </b>Hardie, Andrew<br>
<b>Gesendet:</b> Mittwoch, 18. Mai 2022 13:07<br>
<b>An:</b> Open source development of the Corpus WorkBench <<a href="mailto:cwb@sslmit.unibo.it">cwb@sslmit.unibo.it</a>><br>
<b>Betreff:</b> Re: [CWB] Bad metadata value on input file<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span lang="EN-GB" style="font-size:10.0pt;font-family:"Verdana",sans-serif;color:#1F4B7D">It
<i>is</i> the blanks. If you specify a metadata field as being of datatype “classification”, then every text needs a value for that field.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB" style="font-size:10.0pt;font-family:"Verdana",sans-serif;color:#1F4B7D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB" style="font-size:10.0pt;font-family:"Verdana",sans-serif;color:#1F4B7D">best<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB" style="font-size:10.0pt;font-family:"Verdana",sans-serif;color:#1F4B7D"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB" style="font-size:10.0pt;font-family:"Verdana",sans-serif;color:#1F4B7D">Andrew.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB" style="font-size:10.0pt;font-family:"Verdana",sans-serif;color:#1F4B7D"><o:p> </o:p></span></p>
<div>
<div style="border:none;border-top:solid #E1E1E1 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span lang="EN-US" style="mso-fareast-language:EN-GB">From:</span></b><span lang="EN-US" style="mso-fareast-language:EN-GB">
<a href="mailto:cwb-bounces@sslmit.unibo.it">cwb-bounces@sslmit.unibo.it</a> <<a href="mailto:cwb-bounces@sslmit.unibo.it">cwb-bounces@sslmit.unibo.it</a>>
<b>On Behalf Of </b>Peric Bojan (perc)<br>
<b>Sent:</b> 17 May 2022 15:20<br>
<b>To:</b> <a href="mailto:cwb@sslmit.unibo.it">cwb@sslmit.unibo.it</a><br>
<b>Subject:</b> [CWB] Bad metadata value on input file<o:p></o:p></span></p>
</div>
</div>
<p class="MsoNormal"><span lang="EN-GB"><o:p> </o:p></span></p>
<div>
<p class="MsoNormal">Hi<o:p></o:p></p>
<p class="MsoNormal"><o:p> </o:p></p>
<p class="MsoNormal"><span lang="EN-US">When I try to import a corpus in CQPweb, I get a whole lot of “bad metadata value” errors:<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Bad metadata value on input file line 13915 in column 0: n``'' .<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">I can’t figure out where the problem is. I thought maybe it’s the blanks in the text tag attributes, but the problem persists when the blanks are removed. Any idea how to pinpoint the issue?<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Here’s what a typical text tag looks like:<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><text id="DEB5780" author="" title="" source="BA" page="1-24" topics="" subtopics="" language="de" date="1891" description="" type="Parlamentsdebatten" file="1891_001(AB1891N1-24).tetml" year="1891" decade="1890" url="debatten_data/debatten_tetml/1891_001(AB1891N1-24).tetml"><o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Any help is greatly appreciated.<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Best<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US">Bojan<o:p></o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">------------------------------------------------------------<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">lic. phil. Bojan Peric<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">Wissenschaftlicher Mitarbeiter<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-US" style="mso-fareast-language:DE-CH">ZHAW School of Management and Law<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">Gertrudstrasse 15<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH">CH-8400 Winterthur<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:DE-CH"><o:p> </o:p></span></p>
<p class="MsoNormal"><span lang="EN-GB"><a href="mailto:perc@zhaw.ch"><span lang="DE-CH" style="mso-fareast-language:DE-CH">perc@zhaw.ch</span></a></span><span style="mso-fareast-language:DE-CH"><o:p></o:p></span></p>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
</div>
</body>
</html>