RE: [Corpora-List] POS-tagging learner corpora

From: Rayson, Paul (rayson@exchange.lancs.ac.uk)
Date: Fri Jan 27 2006 - 12:07:11 MET

  • Next message: Vorontsov Alexander: "[Corpora-List] copus of toponyms"

    Hi,

     

    There were a couple of messages last July on corpora about POS tagging
    for learner data (and spoken language). Please see my message at:

     

    http://www.uib.no/mailman/public/corpora/2005-July/001361.html

     

    You should also look at the work of John Milton:

    http://webtools.ust.hk/mmw/Mark_My_Words.htm

    http://webtools.ust.hk/mmw/downloadcmw/

    Resource-Rich Web-Based Feedback: helping learners become independent
    writers

    To appear in Hyland, K & Hyland, F. (Eds) Feedback on ESL Writing.
    Cambridge University Press.

     

    Regards,

    Paul.

     

    Dr. Paul Rayson

    Director of UCREL

    Computing Department, Infolab21, South Drive, Lancaster University,
    Lancaster, LA1 4WA, UK.

    Web: http://www.comp.lancs.ac.uk/computing/users/paul/
    <http://www.comp.lancs.ac.uk/computing/users/paul/>

    Tel: +44 1524 510357 Fax: +44 1524 510492

     

    ________________________________

    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    Behalf Of Xiaotian Guo
    Sent: 26 January 2006 23:32
    To: corpora@uib.no
    Subject: [Corpora-List] POS-tagging learner corpora

     

    Dear All

     

    Has anybody reviewed or assessed the rate of accuracy of POS-tagging
    (not error tagging) to learner corpora, or simply reported the use of
    POS-tagging in their learner corpora studies? Due to the unexpected
    non-native features of learner English, POS tagging systems designed for
    NSs will need some modification or adjustment when used on learner
    corpora. I wonder whether somebody has already worked out some better
    solutions than simply using POS tagging software for general purposes. I
    am aware that this thorny issue of learner corpora study was addressed
    somewhat before (for example, Granger in Granger 2002) and there were
    few studies based on POS tagging to learner corpora, but in this fast
    growing area there might be some progress which has already largely
    outdated the problem (POS-tagging to learner corpora) (fingers crossed).

     

     

    All the best

     

    Xiaotian Guo

     

    PhD candidate

    The Department of English

    The University of Birmingham



    This archive was generated by hypermail 2b29 : Fri Jan 27 2006 - 12:21:44 MET