[CWB] News texts in CQPWeb

Kurt Sultana kurtanatlus at gmail.com
Thu Jan 24 22:39:53 CET 2013


Hi all,

I have a news corpus which I'd like to put in CQPWeb.

I'm currently representing a news text (in Maltese) like this:
<text id="1">
<s>
L NP
- PUN
armi VV
nxtraw VV
separatament MV
minn PRP
l- DDC
istess MJ
kollezzjonista NN
anonimu NN
minn PRP
Texas NP
. PUN
</s>
<s>
Dan PD
ifisser VV
li CMP
l- DDC
armi NN
anke CC
wara PRP
li CMP
nbiegħu VV
se PAF
jibqgħu VV
flimkien MV
. PUN
</s>
</text>

A news text, apart from text, usually contains the title and date of
publication. How could I include this information in the above, for
example? Would these take the form of attributes? And could I run queries
against these new attributes?

Thanks in advance,
Kurt
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130124/2c2ab03b/attachment.html>


More information about the CWB mailing list