RE: [Corpora-List] Portuguese thesaurus

From: Adam Kilgarriff (adam@lexmasterclass.com)
Date: Tue May 03 2005 - 11:46:47 MET DST

  • Next message: Mai Zaki: "[Corpora-List] Annotation of Anaphoric Expressions"

    Mark,

     

    Or, you could load a large Portuguese corpus into the Word Sketch Engine,
    which then automatically produces a distributional thesaurus: see
    http://sketchengine.co.uk <http://sketchengine.co.uk/> We have processed
    corpora, and thesauruses, of this kind for English, Chinese, Czech and Irish
    to date.

     

    But, you may say, are distributional thesaurus as good as traditional ones?
    Needless to say, it all depends what you want to do with them. We have some
    evidence that, for PP-attachment, for Spanish, a distributional thesaurus
    outperforms Spanish WordNet.

     

    Best,

     

                Adam

     

    -----Original Message-----
    From: owner-corpora@lists.uib.no [mailto:owner-corpora@lists.uib.no] On
    Behalf Of Andrew Harley
    Sent: 03 May 2005 09:21
    To: Mark Davies
    Cc: corpora@uib.no; owner-corpora@lists.uib.no
    Subject: Re: [Corpora-List] Portuguese thesaurus

     

    We have a Portuguese version of our Word Selector title available for
    licensing in XML form. See
    <http://www.cambridge.org/elt/elt_projectpage.asp?id=2500260> and
    <http://dictionary.cambridge.org/researchers.htm>. There would be an annual
    licence fee, the exact amount depending on whether one researcher or an
    institution, and in this case on some third parties also. The title is only
    a "mini-thesaurus" with about 10,000 words grouped into semantic categories,
    so may well not have comprehensive enough coverage for your needs. Contact
    me directly if interested.

    Andrew Harley
    Business Systems & Electronic Product Development Manager
    English Language Teaching
    Cambridge University Press

    <http://www.cambridge.org/elt/cdrom>
    <http://dictionary.cambridge.org>
    - the web's favourite learner dictionaries

    owner-corpora@lists.uib.no wrote on 02/05/2005 23:36:01:

    > I'm looking for a machine-readable thesaurus of Portuguese. I've
    > already tried two links to Portuguese WordNet
    > (http://www.clul.ul.pt/WordNet/index.jsp,
    > http://www.instituto-camoes.pt/WordNet/index.jsp) but neither is
    > operational. I've also tried the links at
    > http://www.linguateca.pt/enciclopedias.html, but no luck.
    >
    > Thanks in advance.
    >
    > Mark Davies
    >
    > =================================================
    >
    > Mark Davies
    > Assoc. Prof., Linguistics
    > Brigham Young University
    > (phone) 801-422-9168 / (fax) 801-422-0906
    >
    > http://davies-linguistics.byu.edu
    >
    > ** Corpus design and use // Linguistic databases **
    > ** Historical linguistics // Language variation **
    > ** English, Spanish, and Portuguese **
    >
    > =================================================
    >

    ______________________________________________________________________
    This email has been scanned by the MessageLabs Email Security System.
    For more information please visit http://www.messagelabs.com/email
    ______________________________________________________________________



    This archive was generated by hypermail 2b29 : Wed May 04 2005 - 12:01:44 MET DST