RE: [Corpora-List] KILO

From: Adam Kilgarriff (adam@lexmasterclass.com)
Date: Fri Aug 12 2005 - 08:11:59 MET DST

Next message: Lars Nygaard: "Re: [Corpora-List] Extracting only editorial content from a HTML page"

Previous message: InuH: "[Corpora-List] KILO"
In reply to: InuH: "[Corpora-List] KILO"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

HECTOR was a research project between Oxford University Press (for
lexicography) and DEC (for computing) in the early 1990s. It explored the
potential for improving dictionaries and computational lexicons through
close engagement with corpus data and use of sophisticated computing and NLP
software. Lexicographers manually sense-tagged large sets of corpus
examples for a few hundred words, using hardware and software with what was,
for the time, a ground-breaking corpus search facility (and later turned
into Altavista, the Google of its day). The project was the inspiration for
my work in the WASPS project (http://wasps.itri.bton.ac.uk
<http://wasps.itri.bton.ac.uk/> ) and, more recently, the Sketch Engine
(http://sketchengine.co.uk <http://sketchengine.co.uk/> ) Project leaders
from OUP's side were Sue Atkins (http://www.lexmasterclass.com
<http://www.lexmasterclass.com/> ) and Patrick Hanks
http://www.patrickhanks.com/

It provided the English SENSEVAL-1 dictionary and corpus, used to evaluate
Word Sense Disambiguation (WSD) systems (http://www.senseval.org
<http://www.senseval.org/> )

SEMCOR is a manually sense-tagged corpus produced by the Princeton WordNet
team, based on the WordNet sense inventory, freely available and which has
been very widely used for WSD and related tasks. KILO was also a WordNet
team project - see http://wordnet.princeton.edu
<http://wordnet.princeton.edu/> for more info (to state the obvious)

Adam

-----Original Message-----
From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
Behalf Of InuH
Sent: 11 August 2005 23:22
To: CORPORA@UIB.NO
Subject: [Corpora-List] KILO

Hi list members,

I wonder if anyone could give me detailed information about KILO, HECTOR and
SEMCOR?

Next message: Lars Nygaard: "Re: [Corpora-List] Extracting only editorial content from a HTML page"
Previous message: InuH: "[Corpora-List] KILO"
In reply to: InuH: "[Corpora-List] KILO"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Fri Aug 12 2005 - 08:25:54 MET DST