RE: [Corpora-List] Chinese POS tagger and syntactic parser.

From: Xiao, Zhonghua (z.xiao@lancaster.ac.uk)
Date: Sun May 29 2005 - 11:39:11 MET DST

  • Next message: Andy Roberts: "Re: [Corpora-List] Arabic language under Linux"

     
    The best Chinese POS tagger I have tried is ICTCLAS (the Chinese Lexical Analysis System developed by the Institute of Computing Technologies, the Chinese Academia, Beijing), which is available (source codes and binary compiled for Windows) at
    http://mtgroup.ict.ac.cn/~zhp/ICTCLAS/index.html <http://mtgroup.ict.ac.cn/~zhp/ICTCLAS/index.html>
     
    I haven't tried their Chinese parser extensively, but the output of the online demo seems alright.
    http://mtgroup.ict.ac.cn/parserform.html <http://mtgroup.ict.ac.cn/parserform.html>
     
     
    Richard Xiao

    ________________________________

    From: owner-corpora@lists.uib.no on behalf of Yuanyong Wang
    Sent: Sun 29/05/2005 10:18
    To: CORPORA@UIB.NO
    Subject: [Corpora-List] Chinese POS tagger and syntactic parser.

           Dear list memebers, I'm a research student at UNSW (university of
    New South Wales, Australia) doing research on NLP, (WSD)word sense
    disambiguation in particular. Recently, I'm attempting to utilize the
    information provided by bilingual approaches (such as machine translation)
    back on WSD in English. For that reason, I'm trying to set up the
    environment for machine translation between English and Chinses. But I
    found it quite cumbersome to set up such an environment under Linux,
    should I just switch to Windows? I also tried a couple of Chinese POS
    taggers and parsers, some are ok, but apparently I am expecting something
    better. Could anyone kindly suggest me with some top Chinses POS taggers
    and syntactic parsers? I'm also looking forward to exchanging ideas and
    knowledge with anyone who's interested in similar topic. Thanks very much.

           Regards
           Robin



    This archive was generated by hypermail 2b29 : Sun May 29 2005 - 11:48:17 MET DST