[CWB] Public Access to CWB

Tomaž Erjavec Tomaz.Erjavec at ijs.si
Fri Oct 17 10:52:36 CEST 2014


Hi,
we have our concordancer open, and I think that should be the default, 
except if you have some really sensitive corpora.
However, it is a good idea to have a robots.txt telling crawlers not to 
index the concordancer; in Slovenia we already had a problem where 
somebody complained to our Information Commissioner,  because the first 
hit on Google on her name returned concordances where she was mentioned 
in an old newspaper article in some unsavoury context.
Also, our concordancer once crashed because a search engine sent so many 
queries that the cache filled up the sys disk.
So, my advice would be .htaccess no, robots yes.
Best,
Tomaž

Dne 17.10.2014 ob 10:15 je Stephen Barrett zapisal(a):
> Dear All,
>
> We're about to launch a website which makes use of the excellent CWB. One rather basic question: if we remove htaccess in order for full public access I assume that leaves the site open to abuse. Is there a recommended strategy for public access that keeps the admin side secure?
>
> Many thanks (and many thanks for such an excellent product!)
>
> Regards
>
> Stephen Barrett
> s
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb



More information about the CWB mailing list