Re: [Corpora-List] POS Tagger for German / Java

From: Ciarán Ó Duibhín (ciaran@oduibhin.freeserve.co.uk)
Date: Wed Jan 10 2007 - 14:49:00 MET

  • Next message: Maria Esteva: "[Corpora-List] language sort"

    Michael,
    I cannot judge how good it is, but you might look at the Stuttgart Tree
    Tagger
    http://www.ims.uni-stuttgart.de/projekte/corplex/TreeTagger/DecisionTreeTagg
    er.html which has been trained for German (among other languages).
    I think it is written in C, but I'm not sure of that.
    Ciarán Ó Duibhín.

    ----- Original Message -----
    From: "Michael Sonntag" <sonntag_michael@hotmail.com>
    To: <CORPORA@UIB.NO>
    Sent: Tuesday, January 09, 2007 7:51 PM
    Subject: [Corpora-List] POS Tagger for German / Java

    > Hi all,
    >
    > I am currently working on a system for toponym recognition in natural
    german
    > (web-based) text documents, as my master thesis.
    > The system uses a POS tagger for extracting good NE candidates for a
    > gazetteer.
    >
    > Now, here my question arises
    > 1. Do you know of any good POS tagger for German language, best
    Java-based?
    > (I need only the NE-tagged tokens.)
    > 2. I used tnt, but that one is based on perl/C, and it is not easy to
    > integrate into my java framework.
    > 3. I also used qtag. But it comes only with a, for my task too small data
    > base (lexicon and matrix).
    >
    > So, is there any POS tagger out there that is easy to use and up for the
    > task?
    >
    > Cheers & thx for listening in, yours
    > Mike Sonntag
    >
    > _________________________________________________________________
    > Sie suchen E-Mails, Dokumente oder Fotos? Die neue MSN Suche Toolbar mit
    > Windows-Desktopsuche liefert in sekundenschnelle Ergebnisse. Jetzt neu!
    > http://desktop.msn.de/ Jetzt gratis downloaden!
    >
    >
    >



    This archive was generated by hypermail 2b29 : Wed Jan 10 2007 - 13:52:02 MET