[CWB] Strange issue with character encoding (?) in frequency lists

Scott Sadowsky ssadowsky at gmail.com
Sat May 25 22:38:32 CEST 2019


On Sat, May 25, 2019 at 2:20 PM Hardie, Andrew <a.hardie at lancaster.ac.uk>
wrote:

Hi Andrew,

One possibility is that the wrong charset/collation is being activated for
> the frequency tables. Could you check this?
>
> If you run  create table freq_corpus_*nameofyrcorpus*_word;   the mysql
> command prompt, then the character set / collation should be stated either
> for the table as a whole, or for the “item” column.
>

That shows "ENGINE=InnoDB DEFAULT CHARSET=utf8". All my source texts are
UTF8, and the database is created as that too, by the way.

Cheers,
Scott
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190525/63bf8c00/attachment-0003.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.png
Type: image/png
Size: 102890 bytes
Desc: not available
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190525/63bf8c00/attachment-0003.png>


More information about the CWB mailing list