[CWB] News texts in CQPWeb
Martí Quixal
marti.quixal at gmail.com
Sun Jan 27 12:34:35 CET 2013
Yes, sir, that's what you did and it is right. I recommend you to have a
look at the great documentation manuals.
Best,
Martí
On Sun, Jan 27, 2013 at 5:00 AM, <cwb-request at sslmit.unibo.it> wrote:
> Send CWB mailing list submissions to
> cwb at sslmit.unibo.it
>
> To subscribe or unsubscribe via the World Wide Web, visit
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
> or, via email, send a message with subject or body 'help' to
> cwb-request at sslmit.unibo.it
>
> You can reach the person managing the list at
> cwb-owner at sslmit.unibo.it
>
> When replying, please edit your Subject line so it is more specific
> than "Re: Contents of CWB digest..."
>
>
> Today's Topics:
>
> 1. Re: News texts in CQPWeb (Kurt Sultana)
> 2. Re: News texts in CQPWeb (Kurt Sultana)
>
>
> ----------------------------------------------------------------------
>
> Message: 1
> Date: Sat, 26 Jan 2013 19:41:42 +0100
> From: Kurt Sultana <kurtanatlus at gmail.com>
> To: Open source development of the Corpus WorkBench
> <cwb at sslmit.unibo.it>
> Subject: Re: [CWB] News texts in CQPWeb
> Message-ID:
> <CAEcwAkXzBbyQhZ1QMu581jU44RE_iAJS9FmhFy9O1Y=
> JvXbByg at mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Hi,
>
> I've dug up a bit and have come to know that the attributes I mentioned are
> stored as s-attributes. So, I have this example text:
>
> <news title="A Thrilling Experience" date="01/01/2013" source="
> www.timesofmalta.com">
> <text id="4">
> <p id="1">
> <s>
> Tick NN tick
> . SENT .
> </s>
> <s>
> A DT a
> clock NN clock
> . SENT .
> </s>
> <s>
> Tick VB tick
> , , ,
> tick VB tick
> . SENT .
> </s>
> </p>
> </text>
> </news>
>
> As s-attributes (XML elements) I put in *p*, *p_id*, *news*, *news_title*,
> *
> news_source* and *news_date*. Upon installing the corpus, I select to
> install metadata via xml annotated within the corpus and select
> *news_title*,
> *news_source* and *news_date* however when I click on "Create metadata
> table from XML using settings above", I get this error:
>
> *Error message*
> **** CQP ERROR ****
> CQP Error:
> No annotated values for s-attribute ``news_title'' in named query c_M_F_xml
>
>
> I'm not 100% confident of what I'm doing since it's my first time, so I
> might have easily misunderstood something. What am I doing wrong?
>
> Many thanks in advance,
> Kurt
>
>
> On Thu, Jan 24, 2013 at 10:39 PM, Kurt Sultana <kurtanatlus at gmail.com
> >wrote:
>
> > Hi all,
> >
> > I have a news corpus which I'd like to put in CQPWeb.
> >
> > I'm currently representing a news text (in Maltese) like this:
> > <text id="1">
> > <s>
> > L NP
> > - PUN
> > armi VV
> > nxtraw VV
> > separatament MV
> > minn PRP
> > l- DDC
> > istess MJ
> > kollezzjonista NN
> > anonimu NN
> > minn PRP
> > Texas NP
> > . PUN
> > </s>
> > <s>
> > Dan PD
> > ifisser VV
> > li CMP
> > l- DDC
> > armi NN
> > anke CC
> > wara PRP
> > li CMP
> > nbieg?u VV
> > se PAF
> > jibqg?u VV
> > flimkien MV
> > . PUN
> > </s>
> > </text>
> >
> > A news text, apart from text, usually contains the title and date of
> > publication. How could I include this information in the above, for
> > example? Would these take the form of attributes? And could I run queries
> > against these new attributes?
> >
> > Thanks in advance,
> > Kurt
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130126/ca3f02cc/attachment-0001.html
> >
>
> ------------------------------
>
> Message: 2
> Date: Sun, 27 Jan 2013 10:21:25 +0100
> From: Kurt Sultana <kurtanatlus at gmail.com>
> To: Open source development of the Corpus WorkBench
> <cwb at sslmit.unibo.it>
> Subject: Re: [CWB] News texts in CQPWeb
> Message-ID:
> <CAEcwAkVDfN2W9hd_0Z0+ZM0s3bJ7rAB=
> 4e3KLMe8Tgdi8_74hw at mail.gmail.com>
> Content-Type: text/plain; charset="utf-8"
>
> Bumped into an interesting mail post and I've put in *
> news:0+title+source+date*, *s* and *p:0+id *now as s-attributes. Seems to
> be working now. Could anyone confirm I'm doing this right?
>
> Thanks,
> Kurt
>
>
> On Sat, Jan 26, 2013 at 7:41 PM, Kurt Sultana <kurtanatlus at gmail.com>
> wrote:
>
> > Hi,
> >
> > I've dug up a bit and have come to know that the attributes I mentioned
> > are stored as s-attributes. So, I have this example text:
> >
> > <news title="A Thrilling Experience" date="01/01/2013" source="
> > www.timesofmalta.com">
> > <text id="4">
> > <p id="1">
> > <s>
> > Tick NN tick
> > . SENT .
> > </s>
> > <s>
> > A DT a
> > clock NN clock
> > . SENT .
> > </s>
> > <s>
> > Tick VB tick
> > , , ,
> > tick VB tick
> > . SENT .
> > </s>
> > </p>
> > </text>
> > </news>
> >
> > As s-attributes (XML elements) I put in *p*, *p_id*, *news*,
> *news_title*,
> > *news_source* and *news_date*. Upon installing the corpus, I select to
> > install metadata via xml annotated within the corpus and select *
> > news_title*, *news_source* and *news_date* however when I click on
> > "Create metadata table from XML using settings above", I get this error:
> >
> > *Error message*
> > **** CQP ERROR ****
> > CQP Error:
> > No annotated values for s-attribute ``news_title'' in named query
> c_M_F_xml
> >
> >
> > I'm not 100% confident of what I'm doing since it's my first time, so I
> > might have easily misunderstood something. What am I doing wrong?
> >
> > Many thanks in advance,
> > Kurt
> >
> >
> >
> > On Thu, Jan 24, 2013 at 10:39 PM, Kurt Sultana <kurtanatlus at gmail.com
> >wrote:
> >
> >> Hi all,
> >>
> >> I have a news corpus which I'd like to put in CQPWeb.
> >>
> >> I'm currently representing a news text (in Maltese) like this:
> >> <text id="1">
> >> <s>
> >> L NP
> >> - PUN
> >> armi VV
> >> nxtraw VV
> >> separatament MV
> >> minn PRP
> >> l- DDC
> >> istess MJ
> >> kollezzjonista NN
> >> anonimu NN
> >> minn PRP
> >> Texas NP
> >> . PUN
> >> </s>
> >> <s>
> >> Dan PD
> >> ifisser VV
> >> li CMP
> >> l- DDC
> >> armi NN
> >> anke CC
> >> wara PRP
> >> li CMP
> >> nbieg?u VV
> >> se PAF
> >> jibqg?u VV
> >> flimkien MV
> >> . PUN
> >> </s>
> >> </text>
> >>
> >> A news text, apart from text, usually contains the title and date of
> >> publication. How could I include this information in the above, for
> >> example? Would these take the form of attributes? And could I run
> queries
> >> against these new attributes?
> >>
> >> Thanks in advance,
> >> Kurt
> >>
> >
> >
> -------------- next part --------------
> An HTML attachment was scrubbed...
> URL: <
> http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130127/c1b714bd/attachment-0001.html
> >
>
> ------------------------------
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
>
> End of CWB Digest, Vol 73, Issue 5
> **********************************
>
--
Martí Quixal, Ph.D.
Computational Linguist & Educational Technologist
http://www.iqubo.org/quixal
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20130127/3d8ab31f/attachment.html>
More information about the CWB
mailing list