[Corpora-List] POS-tagging for spoken English and learner English

From: Adam Kilgarriff (adam@lexmasterclass.com)
Date: Thu Jul 21 2005 - 12:37:13 MET DST

  • Next message: Marco Baroni: "[Corpora-List] slides from cl 2005 web-as-corpus workshop"

                      POS-tagging spoken and learner English
                      ======================================
     
          We have a corpus of spoken English (BASE
    http://www.rdg.ac.uk/AcaDepts/ll/base_corpus/ a British equivalent of the
    American MICASE http://www.hti.umich.edu/m/micase/ ) and are now assessing
    how to (automatically) POS-tag it. We are also interested in automatic
    POS-tagging of learner English (which may involve some of the same
    'robustness' issues, even if the linguistics is different)

          Do you have recent experiences of using available taggers on either of
    these kinds of data?

            Reports including accuracy figures would be particularly useful.

            Thank you in advance,

                    Adam Kilgarriff

    ====================================================
    Adam Kilgarriff
    Lexicography MasterClass http://lexmasterclass.com
    Lexical Computing Ltd http://sketchengine.co.uk
    University of Sussex
    mailto:adam@lexmasterclass.com +44 (0)1273 705773
    ====================================================



    This archive was generated by hypermail 2b29 : Thu Jul 21 2005 - 12:49:48 MET DST