[CWB] Public Access to CWB
Tomaž Erjavec
Tomaz.Erjavec at ijs.si
Fri Oct 17 10:52:36 CEST 2014
Hi,
we have our concordancer open, and I think that should be the default,
except if you have some really sensitive corpora.
However, it is a good idea to have a robots.txt telling crawlers not to
index the concordancer; in Slovenia we already had a problem where
somebody complained to our Information Commissioner, because the first
hit on Google on her name returned concordances where she was mentioned
in an old newspaper article in some unsavoury context.
Also, our concordancer once crashed because a search engine sent so many
queries that the cache filled up the sys disk.
So, my advice would be .htaccess no, robots yes.
Best,
Tomaž
Dne 17.10.2014 ob 10:15 je Stephen Barrett zapisal(a):
> Dear All,
>
> We're about to launch a website which makes use of the excellent CWB. One rather basic question: if we remove htaccess in order for full public access I assume that leaves the site open to abuse. Is there a recommended strategy for public access that keeps the admin side secure?
>
> Many thanks (and many thanks for such an excellent product!)
>
> Regards
>
> Stephen Barrett
> s
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
More information about the CWB
mailing list