[CWB] Announcement: Another CWB/CQPweb setup in China
Josep M. Fontana
josepm.fontana at upf.edu
Fri Oct 26 07:46:17 CEST 2012
Hi Jiajin,
This solution will definitely be of help for the most careful and
knowledgeable users. Not that I'm either of those but I remember I went
back to the "view corpus metadata" page a couple of times so I would
probably have noticed those links you added now and I would have found
the info about the right tagset. But definitely, having the link to the
CLAWS7 tagset next to "parts-of-speech-tag" might still confuse some.
JM
> Hi JM, Andrew, and Ray,
>
> The only thing I could do for the Corpus Info section of the lefthand
> menu of CQPweb is the Corpus documentation.
>
> In our Icelandic corpus interface (http://124.193.83.252/cqp/IcePaHC/,
> ID: test, Pass: test), I added the link of the official site of the
> Historical Icelandic corpus to
> http://www.linguist.is/icelandic_treebank/Icelandic_Parsed_Historical_Corpus_%28IcePaHC%29,
> which provides all useful information of the corpus, including the
> tagset used (http://www.linguist.is/icelandic_treebank/Tagset), and
> the download links. Andrew's trial use of Q-A tag has to be the parsed
> part of the corpus, as the corpus has been both PoS-tagged and parsed.
>
> I hope the information above helps.
>
> Best,
>
> Jiajin
>
> Jiajin XU
> Ph.D., associate professor
> National Research Centre for Foreign Language Education
> Beijing Foreign Studies University
> Beijing 100089
> China
> Email: xujiajin at bfsu.edu.cn <mailto:xujiajin at bfsu.edu.cn>
>
>
>
> On Fri, Oct 26, 2012 at 2:10 AM, "Andrés Chandía" <andres at chandia.net
> <mailto:andres at chandia.net>> wrote:
>
> Hi JM
> We've been able of making a query like the one you describe, our
> tagset is personalized so if we want to look for a "Name folloewd
> by an Adjective" we do next at the SQL: "_N* _A*" (wothout the
> doublequotes
>
> if we want to look for a secondary tag like, let say gender, the
> query is like this: {M} (for masculine)
>
> that means:
> query for primary anotation tag = _*
> query for secondary anotation tag = {*}
>
> What we haven't been able to do is find the combination to query
> for a "Noun Masculine" for instance, we have tried many
> combinations with no success ( _N{M} - {N/M}, etc.) so if somebody
> could help us with this we would appreciate it a lot.
>
> @ch
>
>
> El Jue, 25 de Octubre de 2012, 13:04, Josep M. Fontana escribió:
> Hi,
>
> I am a little (or quite) confused about the syntax of CQPweb
> queries (simple query language). I went to the wonderful resource
> Ray Wu has made available so that I could see how it works since
> we are in the process of installing CQPweb as an interface for our
> corpora. I wasn't able to complete any search using the simple
> query language, though. I'm sure it is something very simple that
> I am missing. From what I understand reading the document 'simple
> query language syntax', I should be able to do the following in
> the simple query mode:
>
> _JJ _NN1
>
> which would supposedly look for sequences of an adjective followed
> by noun according to the CLAWS tag set.
>
> OK, I'm conducting the searches in the Old Icelandic Corpus which
> has been supposedly tagged using the CLAWS7 tagset (according to
> the information in "View corpus metadata". When I do this,
> however, I get a message saying "Your query had no results. There
> are no matches for your query." This is very puzzling because you
> would imagine that there would be occurrences of adjectives
> followed by nouns. Doing it the opposite order (_NN1 _JJ) gives me
> the same results. What is even more puzzling is that I also get
> nothing using single POS labels such as _NN1 by itself or _JJ.
>
> Am I doing something wrong or is this due to the fact that this
> particular corpus uses a completely different tagset? When you
> access a CQPWeb corpus, is there any way to retrieve the tags that
> have been used in the corpus? The only relevant info I find in
> this corpus is the link to the CLAWS7 tagset but, as I said, this
> doesn't seem to be the right information. Going into the CQP
> syntax mode and doing "show +pos" doesn't work.
>
>
> JM
>
>> Dear members,
>>
>> We are pleased to announce another CWB/CQPweb setup in China and
>> we dub it BFSU CQPweb. It is closely modelled after Hardie's own
>> (sorry Andrew, we're badly in need of imagination) and currently
>> features more than 20 corpora, including two Brown family cousins
>> (CLOB and Crown) developed at Beijing Foreign Studies Unversity
>> by Dr. Xu Jiajing and Professor Liang Maocheng.
>>
>> You may access it from http://124.193.83.252/cqp/ using test/test
>> as username/password.
>>
>> We'd like to take this opportunity to thank the CWB team for
>> their wonderful work and generosity. It is great fun to build our
>> work on their shoulders.
>>
>> Best,
>> Ray
>
>
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20121026/57d03c0b/attachment.html>
More information about the CWB
mailing list