[CWB] Format for metadata files?
Graham Ranger -- UAPV
graham.ranger at univ-avignon.fr
Sat Dec 3 18:19:15 CET 2016
Hello,
I'm getting the following error message when I try to load the metadata
file for a corpus:
The data source you specified for the text metadata contains
badly-formatted text ID codes, as follows: <strong>
'assollant_rose_d_amour'; 'bruno_le_tour_de_la_france';
'bruyere_l_epee_de_charlemagne'; 'daudet_lettres_de_mon_moulin';
'malot_sans_famille'; 'marcel_les_petits_vagabonds';
'robida_les_assieges_de_compiegne'; 'segur_malheurs_de_sophie';
'segur_un_bon_petit_diable'; 'verne_cinq_semaines_en_ballon';
'verne_le_tour_du_monde'; 'zola_nouveaux_contes_a_ninon';</strong>
(text ids can only contain unaccented letters, numbers, and underscore).
The metadata is in a file called jeunesse.meta in which each line begins
with the text id of the texts in the corpus.
Inside the metadata file, the lines read as follows:
assollant_rose_d_amour alfred_assollant rose_d_amour 1889
1850_1899 roman avance
bruno_le_tour_de_la_france bruno le_tour_de_la_france 1877
1850-1899 manuel_scolaire elementaire
etc.
with text id, author, title, date, period, genre and level.
I can't see what is wrong with the file: the error message suggests that
it's formatted as <strong>, but it's just plain text!
Thanks as always for any help.
Best,
Graham.
More information about the CWB
mailing list