[CWB] Datatype of xml attributes in CQPweb

Meier-Vieracker, Simon simon.meier at tu-berlin.de
Tue Mar 14 18:13:34 CET 2017


Addendum: My registry file looks as follows:

--------

##
## p-attributes (token annotations)
##

ATTRIBUTE word
ATTRIBUTE pos
ATTRIBUTE lemma


##
## s-attributes (structural markup)
##

# <corpus> ... </corpus>
STRUCTURE corpus

# <text id=".." quelle=".." liga=".." saison=".." match=".." url=".."> ... </te$
# (no recursive embedding allowed)
STRUCTURE text
STRUCTURE text_id              # [annotations]
STRUCTURE text_quelle          # [annotations]
STRUCTURE text_liga            # [annotations]
STRUCTURE text_saison          # [annotations]
STRUCTURE text_match           # [annotations]
STRUCTURE text_url             # [annotations]

# <p> ... </p>
STRUCTURE p


# Yours sincerely, the Encode tool.

------------

Best, Simon



Am 14.03.2017 um 17:58 schrieb Meier-Vieracker, Simon <simon.meier at tu-berlin.de<mailto:simon.meier at tu-berlin.de>>:

Hi,

my question of this morning has been resolved (it was a permission problem indeed).

CQPweb works fine now, but I still have a problem with my xml attributes that I want to use as classificatory metadata.

This is what my text template looks like:

<text id="1" quelle="weltfussball" liga="bundesliga" saison="2006_2007" match="bayern-muenchen-borussia-dortmund" url="http://www.weltfussball.de/spielbericht/bundesliga-2006-2007-bayern-muenchen-borussia-dortmund/liveticker/">
<p>
…
</text>

At least the attributes „quelle“ „liga“ and „saison“ I should be able to use as text classifications, for all the values have less than 20 characters, [a-z][A-Z][0-9] and underscore. However, when I try to install metadata from within-corpus XML annotation, the datatype of all field handles is already fixed as free text and cannot be changed.

Thanks in advance
Simon



Am 14.03.2017 um 10:36 schrieb Meier-Vieracker, Simon <simon.meier at tu-berlin.de<mailto:simon.meier at tu-berlin.de>>:

Hi everyone,

I’m new to CQPweb, which I have installed on a Debian server, and I have the following problem with the installation of a new corpus (called ‚herrndorf‘):

I chose "Click here to install a corpus you have already indexed in CWB“ (and the CWB runs perfectly with the corpus), and CQPweb tells me that the corpus was successfully installed. But when trying to design and insert the metadata-table I get the following error:

#####

Not Found

The requested URL /herrndorf/index.php was not found on this server.

Apache/2.4.10 (Debian) Server at fussballlinguistik.linguistik.tu-berlin.de<http://fussballlinguistik.linguistik.tu-berlin.de/> Port 443

#####

Do you have any idea what could have gone wrong? Is there a problem with the permissions? In this tutorial http://chozelinek.github.io/sacoco/cqpwebsetup.html I found that I "have to check the ownership of the folder/file:
• the owner should be wwwrun
• the group should be www“

Thanks in advance

Simon


-------

Dr. Simon Meier

Technische Universität Berlin
Institut für Sprache und Kommunikation
Fachgebiet Allgemeine Linguistik
Sekretariat H42
Straße des 17. Juni 135, 10623 Berlin
+49 (0) 30 314 22323
simon.meier at tu-berlin.de<mailto:simon.meier at tu-berlin.de>
http://www.linguistik.tu-berlin.de/menue/mitarbeiterinnen/wiss_mitarbeiterinnen/simon_meier/




_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it<mailto:CWB at sslmit.unibo.it>
http://liste.sslmit.unibo.it/mailman/listinfo/cwb
_______________________________________________
CWB mailing list
CWB at sslmit.unibo.it<mailto:CWB at sslmit.unibo.it>
http://liste.sslmit.unibo.it/mailman/listinfo/cwb

-------

Dr. Simon Meier

Technische Universität Berlin
Institut für Sprache und Kommunikation
Fachgebiet Allgemeine Linguistik
Sekretariat H42
Straße des 17. Juni 135, 10623 Berlin
+49 (0) 30 314 22323
simon.meier at tu-berlin.de<mailto:simon.meier at tu-berlin.de>
http://www.linguistik.tu-berlin.de/menue/mitarbeiterinnen/wiss_mitarbeiterinnen/simon_meier/




-------------- n�chster Teil --------------
Ein Dateianhang mit HTML-Daten wurde abgetrennt...
URL: <http://liste.sslmit.unibo.it/pipermail/cwb/attachments/20170314/2476e550/attachment-0001.html>


More information about the CWB mailing list