[Sigwac] Gold standard data set corrupted - missing 103.txt

Miloš Jakubíček milos.jakubicek at sketchengine.co.uk
Fri Mar 4 14:41:23 CET 2016


Hi Tom,

I just checked and on the server we only have the archive -- I will try to
see whether we have any old backups, but the file comes from 2007, so the
chances are not very high :(

Best
Milos

Milos Jakubicek

CEO, Lexical Computing
Brighton, UK | Brno, CZ
http://www.lexicalcomputing.com
http://www.sketchengine.co.uk

2016-03-03 19:47 GMT+01:00 Tom Morris <tfmorris at gmail.com>:

> Does anyone have the 103.txt which is supposed to be in the Gold Standard
> data set (http://cleaneval.sigwac.org.uk/GoldStandard.tar.gz) ?
>
> The current 103.txt is, despite it's name, actually a tar file made up of
> all the other files. My guess is that someone typed:
>
>     $ tar cvf *.txt
>
> and the shell expanded that to
>
>     $ tar cvf 103.txt 104.txt 105.txt ...
>
> overwriting the original contents of the file with the tar containing all
> the other files.
>
> If a corrected version of GoldStandard.tar.gz could be made available, that
> would be great.
>
> Best regards,
> Tom Morris
> _______________________________________________
> Sigwac mailing list
> Sigwac at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/sigwac
>


More information about the Sigwac mailing list