[Sigwac] Gold standard data set corrupted - missing 103.txt
Serge Sharoff
s.sharoff at leeds.ac.uk
Fri Mar 4 12:25:37 CET 2016
I have the files from the time we prepared the exercise:
http://corpus.leeds.ac.uk/serge/cleaneval/en-cleaned.tgz
http://corpus.leeds.ac.uk/serge/cleaneval/en-original.tgz
I hope the id's are the same as in the published version.
Best,
Serge
On Thursday 03 Mar 2016 13:47:53 Tom Morris wrote:
> Does anyone have the 103.txt which is supposed to be in the Gold Standard
> data set (http://cleaneval.sigwac.org.uk/GoldStandard.tar.gz) ?
>
> The current 103.txt is, despite it's name, actually a tar file made up of
> all the other files. My guess is that someone typed:
>
> $ tar cvf *.txt
>
> and the shell expanded that to
>
> $ tar cvf 103.txt 104.txt 105.txt ...
>
> overwriting the original contents of the file with the tar containing all
> the other files.
>
> If a corrected version of GoldStandard.tar.gz could be made available, that
> would be great.
>
> Best regards,
> Tom Morris
> _______________________________________________
> Sigwac mailing list
> Sigwac at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/sigwac
More information about the Sigwac
mailing list