[Corpora-List] chinese pos tagger/lemmatizer

From: Marco Baroni (baroni@sslmit.unibo.it)
Date: Thu Jan 19 2006 - 14:28:23 MET

  • Next message: Bruce L. Lambert, Ph.D.: "Re: [Corpora-List] efficient decision tree tool?"

    Dear all,

    Does anybody know of a tokenizer/POS tagger for the Chinese language,
    ideally with these characteristics:

    - documented in English
    - free or cheap
    - runs on the Unix command line, more or less out-of-the-box

    Moreover, we are also looking for a tool/electronic resource that, given a
    tokenized word, would provide a pinyin transcription of the word. Does such
    a tool exist?

    Thanks in advance for the advice.

    Regards,

    Marco

    -- 
    Marco Baroni
    SSLMIT, University of Bologna
    http://sslmit.unibo.it/~baroni
    



    This archive was generated by hypermail 2b29 : Thu Jan 19 2006 - 14:45:16 MET