[Sigwac] Gold standard data set corrupted - missing 103.txt

Serge Sharoff s.sharoff at leeds.ac.uk
Fri Mar 4 12:25:37 CET 2016


I have the files from the time we prepared the exercise:
http://corpus.leeds.ac.uk/serge/cleaneval/en-cleaned.tgz
http://corpus.leeds.ac.uk/serge/cleaneval/en-original.tgz

I hope the id's are the same as in the published version.

Best,
Serge

On Thursday 03 Mar 2016 13:47:53 Tom Morris wrote:
> Does anyone have the 103.txt which is supposed to be in the Gold Standard
> data set (http://cleaneval.sigwac.org.uk/GoldStandard.tar.gz) ?
> 
> The current 103.txt is, despite it's name, actually a tar file made up of
> all the other files. My guess is that someone typed:
> 
>     $ tar cvf *.txt
> 
> and the shell expanded that to
> 
>     $ tar cvf 103.txt 104.txt 105.txt ...
> 
> overwriting the original contents of the file with the tar containing all
> the other files.
> 
> If a corrected version of GoldStandard.tar.gz could be made available, that
> would be great.
> 
> Best regards,
> Tom Morris
> _______________________________________________
> Sigwac mailing list
> Sigwac at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/sigwac


More information about the Sigwac mailing list