RE: [Corpora-List] KILO

From: Adam Kilgarriff (adam@lexmasterclass.com)
Date: Fri Aug 12 2005 - 08:11:59 MET DST

  • Next message: Lars Nygaard: "Re: [Corpora-List] Extracting only editorial content from a HTML page"

    HECTOR was a research project between Oxford University Press (for
    lexicography) and DEC (for computing) in the early 1990s. It explored the
    potential for improving dictionaries and computational lexicons through
    close engagement with corpus data and use of sophisticated computing and NLP
    software. Lexicographers manually sense-tagged large sets of corpus
    examples for a few hundred words, using hardware and software with what was,
    for the time, a ground-breaking corpus search facility (and later turned
    into Altavista, the Google of its day). The project was the inspiration for
    my work in the WASPS project (http://wasps.itri.bton.ac.uk
    <http://wasps.itri.bton.ac.uk/> ) and, more recently, the Sketch Engine
    (http://sketchengine.co.uk <http://sketchengine.co.uk/> ) Project leaders
    from OUP's side were Sue Atkins (http://www.lexmasterclass.com
    <http://www.lexmasterclass.com/> ) and Patrick Hanks
    http://www.patrickhanks.com/

     

    It provided the English SENSEVAL-1 dictionary and corpus, used to evaluate
    Word Sense Disambiguation (WSD) systems (http://www.senseval.org
    <http://www.senseval.org/> )

     

    SEMCOR is a manually sense-tagged corpus produced by the Princeton WordNet
    team, based on the WordNet sense inventory, freely available and which has
    been very widely used for WSD and related tasks. KILO was also a WordNet
    team project - see http://wordnet.princeton.edu
    <http://wordnet.princeton.edu/> for more info (to state the obvious)

     

    Adam

     

     

    -----Original Message-----
    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    Behalf Of InuH
    Sent: 11 August 2005 23:22
    To: CORPORA@UIB.NO
    Subject: [Corpora-List] KILO

     

    Hi list members,

    I wonder if anyone could give me detailed information about KILO, HECTOR and
    SEMCOR?



    This archive was generated by hypermail 2b29 : Fri Aug 12 2005 - 08:25:54 MET DST