Mercurius Treebank

    The aim of this project was the enrichment of an Early New High German newspaper text with syntactic information. We used the TIGER annotation scheme, which was developed out of the NEGRA scheme, which facilitates semi-automatic annotation of constituents and syntactic functions, which simultaneously permitting edge crossing. The annotation process is meant to identify the specific problems posed by the syntactical interpretation of historical texts and to adjust the processes in question accordingly. At the same time, this created the first building block for a treebank of Early New High German. In the meantime, this treebank has been expanded to include another year of newspapers. Now, with the Mercurius of 1667 and the Annus Christi of 1597, we have a newspaper corpus containing 170,000 word forms, syntactically annotated. 

    Project duration: 2003 to 2005


    • Demske, Ulrike, Frank, Nicola, Laufer, Stefanie, Stiemer, Hendrik (2004). Syntactic Interpretation of an Early New High German Corpus. In: Sandra Kübler et al. (eds.): Proceedings of the Third Workshop on Treebanks and Linguistic Theories (TLT 2004). Tübingen, 175-182.
    • Demske, Ulrike (2007). Das Mercurius-Projekt: eine Baumbank für das Frühneuhochdeutsche. In: Gisela Zifonun & Werner Kallmeyer (Hgg.): Sprachkorpora - Datenmengen und Erkenntnisfortschritt. Berlin: de Gruyter (= Jahrbuch des Instituts für deutsche Sprache 2006), 91-104.