Re: [Corpora-List] Portuguese thesaurus

From: Rob Freeman (lists@chaoticlanguage.com)
Date: Mon May 09 2005 - 08:48:54 MET DST

  • Next message: Knut Hofland: "[Corpora-List] Request for updated/new ICAME material"

    Hi Adam,

    On Tuesday 03 May 2005 17:46, you wrote:
    > ...We have
    > some evidence that, for PP-attachment, for Spanish, a distributional
    > thesaurus outperforms Spanish WordNet.

    That's an interesting comment...

    How do you use your "distributional thesaurus" to do PP-attachment?

    I have a parsing engine which works along what may be the same lines.

    In general I believe the approach should be more powerful than one based on
    any fixed classification (like WordNet), on the grounds that you can store
    more (and especially contradictory) grammatical/semantic information in a
    distributed representation.

    You can see a general discussion in:

    Freeman R. J., Example-based Complexity--Syntax and Semantics as the
    Production of Ad-hoc Arrangements of Examples, Proceedings of the ANLP/NAACL
    2000 Workshop on Syntactic and Semantic Complexity in Natural Language
    Processing Systems, pp. 47-50. (http://acl.ldc.upenn.edu/W/W00/W00-0108.pdf)

    The parser has so far been implemented for English, Danish, and Chinese. I
    haven't done much benchmarking against specific class-based systems, however.
    I'd be interested to see your exact results?

    Best regards,

    Rob Freeman



    This archive was generated by hypermail 2b29 : Mon May 09 2005 - 09:11:47 MET DST