RE: [Corpora-List] wordlist-similarity tools in Java?

From: Andy Roberts (andyr@comp.leeds.ac.uk)
Date: Sat Feb 24 2007 - 16:25:20 MET

  • Next message: jeremy ellman: "Re: [Corpora-List] chaker : question about text classification rainbow package"

    Nuno,

    That's a very handy library. However, may I ask why you don't use POS
    information in your index (and therefore your similarity algorithms)?
    Your library permits comparisons at a sense level but I don't see how
    you compare sense 1 of live with dwell with out first telling the program that
    you're only interested in the verb and not the other two homographs.

    Regards,
    Andy

    On Fri, 23 Feb 2007, Nuno Seco wrote:

    > Take a look at:
    > http://wordnet.princeton.edu/links#extensions
    >
    > You should find a library computing similarity using Information Theoretic
    > models in both Java and Prolog.
    >
    > Cheers,
    >
    > --
    > Nuno Seco
    >
    >> -----Original Message-----
    >> From: owner-corpora@lists.uib.no
    >> [mailto:owner-corpora@lists.uib.no] On Behalf Of Eric Atwell
    >> Sent: sexta-feira, 23 de Fevereiro de 2007 13:53
    >> To: CORPORA@UIB.NO
    >> Cc: Stella Kleanthous
    >> Subject: [Corpora-List] wordlist-similarity tools in Java?
    >>
    >>
    >> Stella Kleanthous, Leeds PhD student, asked me for a Java
    >> tool/resource to measure the semantic similarity between two
    >> lists of (English) words.
    >> I directed her at WordNet-Similarity:
    >>
    >> http://www.d.umn.edu/~tpederse/similarity.html
    >> http://sourceforge.net/projects/wn-similarity
    >> http://search.cpan.org/dist/WordNet-Similarity/
    >>
    >> BUT this is Perl software; has anyone implemented similar in Java?
    >> (which Stella (and others) could integrate into Java programs?)
    >>
    >> thanks
    >>
    >>
    >> Eric Atwell,
    >> Senior Lecturer, Language research group leader, School of
    >> Computing, Faculty of Engineering, University of Leeds, LEEDS
    >> LS2 9JT, England
    >> TEL: 0113-3435430 FAX: 0113-3435468 WWW: just Google eric atwell
    >>
    >
    >



    This archive was generated by hypermail 2b29 : Sat Feb 24 2007 - 16:31:09 MET