<html><head><meta http-equiv="Content-Type" content="text/html; charset=utf-8"></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; line-break: after-white-space;" class=""><div class="">Hi Andrew,</div><div class=""><br class=""><blockquote type="cite" class=""><blockquote type="cite" class=""></blockquote></blockquote><blockquote type="cite" class=""><blockquote type="cite" class=""><blockquote type="cite" class="">there is a conceptually odd pattribute nbc in there<br class=""></blockquote></blockquote><br class="">This is in the 0.99 version, but not the 1.0 version ? so yes, you?re right, it is odd and it has been removed! The 1.0 files are in the repo here:<br class=""><a href="https://sourceforge.net/p/cwb/code/HEAD/tree/doc/corpora/dickens/release/" class="">https://sourceforge.net/p/cwb/code/HEAD/tree/doc/corpora/dickens/release/</a></blockquote><div class=""><br class=""></div><div class="">Right - then it would prob. be good to update the link here, since that is where I got the corpus today, and which is where most other people are also likely to get it, and which indeed points to version 0.99:&nbsp;</div><div class=""><br class=""></div><div class=""><a href="http://cwb.sourceforge.net/download.php#corpora" class="">http://cwb.sourceforge.net/download.php#corpora</a></div><br class=""><blockquote type="cite" class="">The current approach is a better-than-nothing attempt to deal with embedding by encoded embedded instances of an XML tag onto the separate attributes created by the numbers. So CQP doesn?t really ?know? about these in any sense. np1 and np2 are as different as text and chapter. Thus no support in search.<br class=""></blockquote><br class=""></div><div class="">Understood. But why are they then simple np1_h ? I see that makes no difference for the system, but it seems the more logical naming somehow…</div><div class=""><br class=""></div><div class=""><br class=""></div></body></html>