[Corpora-List] CCGbank

From: Julia Hockenmaier (juliahr@cis.upenn.edu)
Date: Wed Jun 08 2005 - 20:18:55 MET DST

  • Next message: Allauzen: "Re: [Corpora-List] POS tagger & syntatic parser"

    CCGbank is now available from the LDC:
    http://www.ldc.upenn.edu/Catalog/CatalogEntry.jsp?catalogId=LDC2005T13

    CCGbank is a translation of the Penn Treebank into a corpus of
    Combinatory Categorial Grammar derivations. It pairs syntactic
    derivations with sets of word-word dependencies which approximate the
    underlying predicate-argument structure. CCGbank contains 99.44% of
    the sentences in the Penn Treebank, for which it corrects a number of
    inconsistencies and errors in the original annotation.
    CCGbank can also be searched with Douglas Rohde's TGrep2, version 1.15 or higher.

    Julia Hockenmaier and Mark Steedman
    juliahr@cis.upenn.edu, steedman@inf.ed.ac.uk

    http://groups.inf.ed.ac.uk/ccg



    This archive was generated by hypermail 2b29 : Thu Jun 09 2005 - 10:13:47 MET DST