[CWB] Embedded sattributes
Maarten Janssen
maartenpt at gmail.com
Wed Apr 24 14:57:36 CEST 2019
Hi Andrew,
>>> there is a conceptually odd pattribute nbc in there
>
> This is in the 0.99 version, but not the 1.0 version ? so yes, you?re right, it is odd and it has been removed! The 1.0 files are in the repo here:
> https://sourceforge.net/p/cwb/code/HEAD/tree/doc/corpora/dickens/release/ <https://sourceforge.net/p/cwb/code/HEAD/tree/doc/corpora/dickens/release/>
Right - then it would prob. be good to update the link here, since that is where I got the corpus today, and which is where most other people are also likely to get it, and which indeed points to version 0.99:
http://cwb.sourceforge.net/download.php#corpora
> The current approach is a better-than-nothing attempt to deal with embedding by encoded embedded instances of an XML tag onto the separate attributes created by the numbers. So CQP doesn?t really ?know? about these in any sense. np1 and np2 are as different as text and chapter. Thus no support in search.
Understood. But why are they then simple np1_h ? I see that makes no difference for the system, but it seems the more logical naming somehow…
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20190424/ce5fef06/attachment.html>
More information about the CWB
mailing list