[CWB] xml fiels

Ruprecht von Waldenfels ruprecht.waldenfels at gmx.net
Wed Dec 17 15:16:46 CET 2014


Hi,
this is really not TEI, but it's a simple XML format  - here's an XSLT 
script that should work:

<?xml version="1.0" encoding="UTF-8"?>
<xsl:stylesheet version="2.0" 
xmlns:xsl="http://www.w3.org/1999/XSL/Transform" 
xmlns:fn="http://www.w3.org/2003/11/xpath-functions">
     <xsl:output method="txt" omit-xml-declaration="yes"/>
     <xsl:strip-space elements="*"/>


<xsl:template match="w">
<xsl:value-of select="."/>
<xsl:value-of select="'&#9;'"/>
<xsl:value-of select="@lemma"/>
<xsl:value-of select="'&#9;'"/>
<xsl:value-of select="@msd"/>
<xsl:value-of select="'&#10;'"/>
</xsl:template>


</xsl:stylesheet>


If you want any more w attributes to be preserved in the CWB input, just 
add attributes in an analogous way.

Best!
Ruprecht



Am 17.12.2014 um 14:00 schrieb Ingrid Sör:
> Thanks for your reply Ruprecht.
> I am sending you a short excerpt of the beginning of one corpus, as I 
> can't find information regarding if they are TEI or not and can't tell 
> myself. If you can see that it is TEI, I would be very happy to try 
> your XSLT script - very kind of you to share your code.
>
> Best, Ingrid
>
>
> On 17 December 2014 at 12:21, Ruprecht von Waldenfels 
> <waldenfels at issl.unibe.ch <mailto:waldenfels at issl.unibe.ch>> wrote:
>
>     Hi,
>     if this is TEI, I can send you my XSLT script.
>     Best,
>     Ruprecht
>     Am 17.12.2014 um 12:00 schrieb cwb-request at sslmit.unibo.it
>     <mailto:cwb-request at sslmit.unibo.it>:
>>     Send CWB mailing list submissions to
>>     	cwb at sslmit.unibo.it  <mailto:cwb at sslmit.unibo.it>
>>
>>     To subscribe or unsubscribe via the World Wide Web, visit
>>     	http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>>     or, via email, send a message with subject or body 'help' to
>>     	cwb-request at sslmit.unibo.it  <mailto:cwb-request at sslmit.unibo.it>
>>
>>     You can reach the person managing the list at
>>     	cwb-owner at sslmit.unibo.it  <mailto:cwb-owner at sslmit.unibo.it>
>>
>>     When replying, please edit your Subject line so it is more specific
>>     than "Re: Contents of CWB digest..."
>>
>>
>>     Today's Topics:
>>
>>         1. Bug report-CQPweb 3.1.11 (Umut Demirhan)
>>         2. Re: Bug report-CQPweb 3.1.11 (Hardie, Andrew)
>>         3. xml files (Ingrid S?r)
>>
>>
>>     _______________________________________________
>>     CWB mailing list
>>     CWB at sslmit.unibo.it  <mailto:CWB at sslmit.unibo.it>
>>     http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
>
>     _______________________________________________
>     CWB mailing list
>     CWB at sslmit.unibo.it <mailto:CWB at sslmit.unibo.it>
>     http://devel.sslmit.unibo.it/mailman/listinfo/cwb
>
>
>
> _______________________________________________
> CWB mailing list
> CWB at sslmit.unibo.it
> http://devel.sslmit.unibo.it/mailman/listinfo/cwb

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://devel.sslmit.unibo.it/pipermail/cwb/attachments/20141217/0c6e0520/attachment-0001.html>


More information about the CWB mailing list