[CWB] Announcement: Another CWB/CQPweb setup in China

Hardie, Andrew a.hardie at lancaster.ac.uk
Thu Oct 25 20:12:33 CEST 2012


Sorry, that should be

{M}_N

(d'oh!)

Andrew.

From: Hardie, Andrew
Sent: 25 October 2012 19:12
To: 'Open source development of the Corpus WorkBench'
Subject: RE: [CWB] Announcement: Another CWB/CQPweb setup in China

{N}_M

best

Andrew.

From: cwb-bounces at sslmit.unibo.it<mailto:cwb-bounces at sslmit.unibo.it> [mailto:cwb-bounces at sslmit.unibo.it] On Behalf Of "Andrés Chandía"
Sent: 25 October 2012 19:10
To: Open source development of the Corpus WorkBench
Subject: Re: [CWB] Announcement: Another CWB/CQPweb setup in China

Hi JM
We've been able of making a query like the one you describe, our tagset is personalized so if we want to look for a "Name folloewd by an Adjective" we do next at the SQL: "_N* _A*" (wothout the doublequotes

if we want to look for a secondary tag like, let say gender, the query is like this: {M} (for masculine)

that means:
query for primary anotation tag = _*
query for secondary anotation tag = {*}

What we haven't been able to do is find the combination to query for a "Noun Masculine" for instance, we have tried many combinations with no success ( _N{M} - {N/M}, etc.) so if somebody could help us with this we would appreciate it a lot.

@ch


El Jue, 25 de Octubre de 2012, 13:04, Josep M. Fontana escribió:
Hi,

I am a little (or quite) confused about the syntax of CQPweb queries (simple query language). I went to the wonderful resource Ray Wu has made available so that I could see how it works since we are in the process of installing CQPweb as an interface for our corpora. I wasn't able to complete any search using the simple query language, though. I'm sure it is something very simple that I am missing. From what I understand reading the document 'simple query language syntax', I should be able to do the following in the simple query mode:

_JJ _NN1

which would supposedly look for sequences of an adjective followed by noun according to the CLAWS tag set.

OK, I'm conducting the searches in the Old Icelandic Corpus which has been supposedly tagged using the CLAWS7 tagset (according to the information in "View corpus metadata". When I do this, however, I get a message saying "Your query had no results. There are no matches for your query." This is very puzzling because you would imagine that there would be occurrences of adjectives followed by nouns. Doing it the opposite order (_NN1 _JJ) gives me the same results. What is even more puzzling is that I also get nothing using single POS labels such as _NN1 by itself or _JJ.

Am I doing something wrong or is this due to the fact that this particular corpus uses a completely different tagset? When you access a CQPWeb corpus, is there any way to retrieve the tags that have been used in the corpus? The only relevant info I find in this corpus is the link to the CLAWS7 tagset but, as I said, this doesn't seem to be the right information. Going into the CQP syntax mode and doing "show +pos" doesn't work.


JM
Dear members,

We are pleased to announce another CWB/CQPweb setup in China and we dub it BFSU CQPweb. It is closely modelled after Hardie's own (sorry Andrew, we're badly in need of imagination) and currently features more than 20 corpora, including two Brown family cousins (CLOB and Crown) developed at Beijing Foreign Studies Unversity by Dr. Xu Jiajing and Professor Liang Maocheng.

You may access it from http://124.193.83.252/cqp/ using test/test as username/password.

We'd like to take this opportunity to thank the CWB team for their wonderful work and generosity. It is great fun to build our work on their shoulders.
Best,
Ray





_______________________________________________ CWB mailing list CWB at sslmit.unibo.it<mailto:CWB at sslmit.unibo.it> http://devel.sslmit.unibo.it/mailman/listinfo/cwb




_______________________
            andrés chandía
[Image removed by sender. chandia.net]<http://www.chandia.net>
P No imprima innecesariamente. ¡Cuide el medio ambiente!
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20121025/53c14d96/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image001.jpg
Type: image/jpeg
Size: 823 bytes
Desc: image001.jpg
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20121025/53c14d96/attachment.jpg>


More information about the CWB mailing list