[Corpora-List] BulTreeBank data release: Dependency Part and Morphologically Annotated Part

From: Kiril Simov (kivs@bultreebank.org)
Date: Mon Oct 23 2006 - 16:34:38 MET DST

  • Next message: Dr. Lothar Lemnitzer: "[Corpora-List] CALL FOR PAPERS: Lexical-Semantic and Ontological Resources"

    Dear Colleagues,

    I would like to inform you that we have released the dependancy part and the
    morphologically annotated part of our treebank.

    The dependancy part of the treebank contains above 196000 tokens (13200
    sentences).
    It was used for the CoNNL-X shared task this year
    (http://nextens.uvt.nl/~conll/).

    The morphologically annotated part of the treebank contains above 214000
    tokens (15000 sentences).
    It was used for training of the TreeTagger for Bulgarian
    (http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/).

    Both datasets are available from our web page:

    http://www.bultreebank.org/Resources.html

    With best regards,

    Kiril Simov

    -----------------------------------------------------------------
    Kiril Simov
    BulTreeBank Project
    Linguistic Modelling Laboratory, IPP,
    Bulgarian Academy of Sciences
    Acad. G.Bonchev St. 25A
    1113 Sofia, Bulgaria
    E-mail: kivs@bultreebank.org
    Web: http://www.bultreebank.org/
    -----------------------------------------------------------------



    This archive was generated by hypermail 2b29 : Mon Oct 23 2006 - 16:38:56 MET DST