Re: [Corpora-List] Phrase extraction

From: Marco Baroni (baroni@sslmit.unibo.it)
Date: Mon Oct 24 2005 - 21:22:52 MET DST

Next message: David Brooks: "[Corpora-List] [Corpora List] looking for "Intonation Phrase" corpora"

Previous message: Helge Thomas Karset Hellerud: "[Corpora-List] Phrase extraction"
In reply to: Helge Thomas Karset Hellerud: "[Corpora-List] Phrase extraction"
Next in thread: Antti Arppe: "Re: [Corpora-List] Phrase extraction"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hi there.

Regarding the first option (creating a tagger for Norwegian):

Perhaps this is obvious, but if you are willing to assign tags to a
certain number of documents (say, about 15000 words) by hand, then you can
"train" a part of specch tagger, e.g., one or more of the acopost taggers
(http://sourceforge.net/projects/acopost/). Or, you could try to contact
somebody who already did that (just look for information on annotated
Norwegian corpora on the Web), and see if they can let you use their
tagger, or at least let you train a tagger on their annotated data...

Regards,

Marco

Next message: David Brooks: "[Corpora-List] [Corpora List] looking for "Intonation Phrase" corpora"
Previous message: Helge Thomas Karset Hellerud: "[Corpora-List] Phrase extraction"
In reply to: Helge Thomas Karset Hellerud: "[Corpora-List] Phrase extraction"
Next in thread: Antti Arppe: "Re: [Corpora-List] Phrase extraction"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Mon Oct 24 2005 - 21:59:40 MET DST