Re: [Corpora-List] lexicographic tools for parallel/comparable corpora

From: tufis@racai.ro
Date: Wed Feb 07 2007 - 10:39:06 MET

  • Next message: Mihai Surdeanu: "[Corpora-List] CLEF-QA QAst: Release of the development dataset and second call for participation"

    Hi Jörg,

    Here are some references to our work in exploiting parallel corpora:

    Best regards,

    Dan

    ---------------
    Dan Tufiș: Word Senses: The Stepping Stones in Semantic-Based Natural Language
    Processing. In IFIP International Federations for Information Processing,
    Volume 204, Artificial Intelligence Applications and Innovations, eds.
    Maglogianis, I., Karpouzis, K., Bramer, M. (Boston, Springer), 2006, pp.
    575-582, ISBN 0-387-34223-0

    Alexandru Ceaușu, Dan Ștefănescu, Dan Tufiș:Acquis Communautaire sentence
    alignment using Support Vector Machines. In proceedings of the 5th LREC
    Conference, Genoa, Italy, 22-28 May, 2006, pp2134-2137 , ISBN 2-9517408-2-4,
    EAN 9782951740822

    Dan Tufiș, Radu Ion, Alexandru Ceaușu, Dan Ștefănescu: Improved Lexical
    Alignment by Combining Multiple Reified Alignments. In Proceedings of the 11th
    Conference of the European Chapter of the Association for Computational
    Linguistics (EACL2006), Trento, Italy, 3-7 April, 2006, pp. 153-160, ISBN
    1-9324-32-61-2

    Dan Tufiș, Radu Ion: Evaluating the word sense disambiguation accuracy with
    three different sense inventories. In Proceedings of the Natural Language
    Understanding and Cognitive Systems Symposium, Miami, Florida, May 2005, pp.
    118-127, ISBN 972-8865-23-6

    Dan Tufiș, Radu Ion, Nancy Ide: Fine-Grained Word Sense Disambiguation Based on
    Parallel Corpora, Word Alignment, Word Clustering and Aligned Wordnets. In
    proceedings of the 20th International Conference on Computational Linguistics,
    COLING2004, Geneva, 2004, pp. 1312-1318, ISBN 1-9324432-48-5

    Dan Tufiș. Term Translations in Parallel Corpora: Discovery and Consistency
    Check. In Proceedings of the 4th LREC Conference, Lisabona, 2004, pp. 1981-1984

    Dan Tufiș, Ana Maria Barbu, Radu Ion, Extracting Multilingual Lexicons from
    Parallel Corpora, Computers and the Humanities, Volume 38, Issue 2, May 2004,
    Pages 163 – 189 (http://dx.doi.org/10.1023/B:CHUM.0000031172.03949.48) ISSB
    0010-4817

    Dan Tufiș ”A cheap and fast way to build useful translation lexicons” in
    Proceedings of the 19th International Conference on Computational Linguistics,
    COLING2002, Taipei, 25-30 August, 2002, pp. 1030-1036, ISBN 1-55860-894

    Quoting Joerg Tiedemann <tiedeman@let.rug.nl>:

    >
    > Dear list members,
    >
    >
    > I'm looking for information about tools for the lexicographic use of
    > parallel and comparable corpora. Any links are appreciated (including
    > research tools, commercial tools and open-source alternatives). I'm also
    > interested in user experience and evaluation.
    >
    > Many thanks in advance!
    >
    >
    > Jörg Tiedemann
    >
    > ***********/\/\/\/\/\/\/\/\/\/\/\************************************
    > ** Jörg Tiedemann tiedeman@let.rug.nl **
    > ** Alfa-Informatica http://www.let.rug.nl/~tiedeman **
    > ** Rijksuniversiteit Groningen Harmoniegebouw, room 1311-429 **
    > ** Oude Kijk in 't Jatstraat 26 phone: +31 (0)50-363 5935 **
    > ** 9712 EK Groningen fax: +31 (0)50-363 6855 **
    > *************************************/\/\/\/\/\/\/\/\/\/\/\**********

    -----------------------------------------------------------------------
    This mail was sent through IMP (http://horde.org/imp/) @ valhalla.racai.ro

    -- 
    This message was scanned for spam and viruses by BitDefender.
    For more information please visit http://linux.bitdefender.com/
    



    This archive was generated by hypermail 2b29 : Wed Feb 07 2007 - 07:28:25 MET