[Corpora-List] Annotation Tool for German corpora/NE recognition task

From: Michael Sonntag (sonntag_michael@hotmail.com)
Date: Tue Oct 17 2006 - 15:57:44 MET DST

  • Next message: Frank.Schilder@thomson.com: "RE: [Corpora-List] Annotation Tool for German corpora/NE recognition task"

    Dear all,

    I am currently undertaking a master thesis in the area of toponym
    recognition within German texts.
    I have already quite large German corpora for this endevour, and I have
    build some models with the help of UIMA and Gazetteers to extract toponyms.

    What I am really missing is:
    - a good tool to annotate some documents quickly, i.e. with information
    about : toponym, first and surname, and other NE´s. This, to get an idea
    (prec.+recall) about the quality of my models.
    - still better: an annotated corpus. Is there any out there?

    To get an idea of my model(s) and toponym extraction, I put together a
    Google Map with my extraction results. For the interested :
    www.msonntag.de/map/map.html
    (quite a lot of data, so it might take a while; results are very very bad at
    the time being, but there are still some things to do, so I am not worried
    about that)

    Cheers & thx for your time, yours
    Dr. Michael Sonntag
    Univ. Bamberg



    This archive was generated by hypermail 2b29 : Tue Oct 17 2006 - 15:55:38 MET DST