[Sigwac] orthographic errors in web pages paper

Marco Baroni baroni at sslmit.unibo.it
Thu Oct 12 19:12:13 CEST 2006



> I'm amazed that they managed to collect a reasonable corpus at all.
> Zipf's law naturally drives such searches towards finding "very large
> lists of general keywords".

Indeed, that's what they find, right?

M


More information about the Sigwac mailing list