[CWB] Fwd: annonce de rcqp

Sylvain Loiseau sylvain.loiseau at wanadoo.fr
Mon Jun 25 17:10:56 CEST 2012


Dear all,

We are very pleased to announce the release of an R plugin called 'rcqp' for accessing CWB corpora through R. This plugin has just been published on CRAN (the R repository of plugins) and can be installed fairly easily as any standard R plugin.

This plugin lets you query the corpora of a CWB registry using the CQi set of functions and provides some higher level functions for retrieving kwic, frequency lists or lexical tables in R. This plugin aims at making easier the analysis of CWB corpora with statistical methods. Moreover, CWB has been turned into a shared library and is directly loaded by R, so the rcqp package is a standalone package which does not require the CWB software to be installed on the machine. R data structures are just wrappers for CWB data structures and the access is thus quite efficient. At last, CWB structures (often vectors) are quite easy to manipulate with a vector-oriented language such as R. 

Once installed and loaded, you can access the documentation using :
> vignette("rcqp")

This document may be accessed on line as well at :

http://cran.at.r-project.org/web/packages/rcqp/vignettes/rcqp.pdf

The rcqp plugin requires (as does CWB itself) two libraries: glib2 and pcre. These libraries have to be installed on linux and mac os x in order for the plugin to function properly. No version for Windows is available on CRAN, but a cross-compiled version of the library exists and can be distributed.

We are planning, for a future release, to make the installation simpler on mac and possibly on Windows, as well as to include more and maybe more carefully-designed functions. 

Please give it a try and feel free to make comments and suggestions!
Best regards,
Bernard Desgraupes
Sylvain Loiseau

-------------- next part --------------
An HTML attachment was scrubbed...
URL: http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20120625/ec8e46f8/attachment.htm


More information about the CWB mailing list