RE: [Corpora-List] About multiple sense tags used in the answer key file of Senseval

From: Adam Kilgarriff (adam@lexmasterclass.com)
Date: Mon May 09 2005 - 10:13:54 MET DST

  • Next message: paul.mcnamee@jhuapl.edu: "Re: [Corpora-List] Corpus of academic German?"

    Dear Robin,

    The trouble with word sense disambiguation is word senses. They just won't
    behave.

    Sometimes, the best that a human can do is to say that a corpus instance is
    related to more than one word sense (so it is tagged with multiple sense
    tags) or that it is unassignable (U) or that it is like one of the senses in
    one way but not in others (combination of U and one or more regular sense
    tag.) This is the scheme we have used for English for all three Sensevals,
    you can find descriptions in the SENSEVAL 1 Special Issue of Computers and
    the Humanities 34 (1-2) amongst other places, here are links to papers that
    discuss it

            Best
                    Adam

    2000 (with Joseph Rosenzweig) "English Framework and Results
    <http://www.lexmasterclass.com/people/Publications/2000-KilgRosenzweig-Sense
    val1frame.pdf> ." Computers and the Humanities 34 (1-2), Special Issue on
    SENSEVAL.
    2000 (with Martha Palmer) Introduction to the Special Issue on SENSEVAL
    <http://www.lexmasterclass.com/people/Publications/2000-KilgPalmer-Senseval1
    Intro.pdf> . Computers and the Humanities 34 (1-2). (Also guest editors for
    the Special Issue)

    -----Original Message-----
    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    Behalf Of Yuanyong Wang
    Sent: 09 May 2005 07:34
    To: corpora@uib.no
    Subject: [Corpora-List] About multiple sense tags used in the answer key
    file of Senseval

             Dear list memebers, I'm a research student doing research on NLP,
    word sense disambiguation in particular. I am planning to conduct
    experiment on the Senseval-3 data. But after reading the answer key file,
    one fact appears a bit confusing, sometimes for one test case, multiple
    sense
    tags are given, and one of the multiple sense tags could be simply a
    letter "U". I tried and looked into the documentation, but haven't found
    any relevant information. Could anyone kindly tell me how to make sense
    of those multiple sense tag cases or probably point me to the relevant
    documentation?

           Regards
           Robin.



    This archive was generated by hypermail 2b29 : Mon May 09 2005 - 15:04:30 MET DST