Re: [Corpora-List] Phrase extraction

From: Antti Arppe (aarppe@ling.helsinki.fi)
Date: Tue Oct 25 2005 - 11:32:49 MET DST

  • Next message: Diana Maynard: "Re: [Corpora-List] Phrase extraction"

    On Mon, 24 Oct 2005, Helge Thomas Karset Hellerud wrote:
    > PoS (Part of Speech) tagging is often used to extract phrases from text
    > (like Noun Phrases). But that approach assumes you have a PoS tagger
    > available. My document collection is in Norwegian, but I don't have a
    > Norwegian tagger.
    >
    > 1) Is there a way to create a simple PoS tagger to recognize verbs,
    > nouns and adjectives (in Norwegian)?

    Before creating your own tagger, have you or your department
    considered getting/licensing Multitagger (a PoS tagger for Norwegian
    created by the Universitetet i Oslo / Textlaboratoriet / Janne Bondi
    Johannessen) or an academic version of Connexor's dependency parser
    (Machinese) for Norwegian?

             -Antti Arppe



    This archive was generated by hypermail 2b29 : Tue Oct 25 2005 - 11:57:14 MET DST