[Corpora-List] Re: learning affix rules from wordlist

From: xuri tang (tangxuriyz@yahoo.com.cn)
Date: Thu May 11 2006 - 05:10:42 MET DST

  • Next message: Ajith Abraham: "[Corpora-List] ISDA'06 - Deadline Extension"

    Hi, listmemebers.
      Several weeks ago, I posted an inquiry about statistical learning of affix rules from wordlist. Thanks to the kindness of Noah Smith of Johns Hopkins University, Eric Artwell of Leed University, Peter Adolphs, Leonid Kontorovich of CMU and some others, I was able to obtain a list of articles and other relevant information in the field. My heart-felt gratitude goes to all of them.
    Here is a summary:
      R. Wicentowski. "Multilingual Noise-Robust Supervised Morphological Analysis using the WordFrame Model." In Proceedings of Seventh Meeting of the ACL Special Interest Group on Computational Phonology (SIGPHON), pp. 70-77, 2004.
    R. Wicentowski. Improving Statistical MT Through Morphological Analysis. Sharon Goldwater and David McClosky. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Vancouver, 2005.
      Antal van den Bosch and Walter Daelemans. Memory-based morphological analysis
    Reference: In: Proceedings of the 37th Annual Meeting of the Association for Computational Linguistics, ACL'99, University of Maryland, USA, June 20-26, 1999, pp. 285-292. ILK pub: ILK-9909
      Leonid Kontorovich et al. 2003. A Markov Model for the Acquisition of Morphological Structure. Available at http://reports-archive.adm.cs.cmu.edu/anon/2003/CMU-CS-03-147.pdf
      The PASCAL MorphoChallenge contest results at http://www.cis.hut.fi/morphochallenge2005/results.shtml
      The following is attributed to Peter Adolphs:
    * Bosch & Daelemans (1999). A. van den Bosch & Walter Daelemans:
    "Memory-Based Morphological Analysis". Proceedings of the 37th Annual
    Meeting of the ACL. San Francisco/CA 1999: Morgan Kaufmann, 285-292.
    * Cavar et al (2006). Ćavar, Damir; Rodriguez, Paul & Schrementi,
    Giancarlo: "Unsupervised morphology induction for
    part-of-speech-tagging". In: Penn Working Papers in Linguistics:
    Proceedings of the 29th Annual Penn Linguistics Colloquium. Vol. 12.1.
    2006. pp. 29�2.
    * Creutz (2003). Mathias Creutz: "Unsupervised Segmentation of Words
    Using Prior Distributions of Morph Length and Frequency". Proceedings of
    the 41st Annual Meeting of the Association for Computational
    Linguistics, July 2003, pp. 280-287.
    * Creutz & Lagus (2002). Mathias Creutz and Krista Lagus:
    "Unsupervised Discovery of Morphemes". Morphological and Phonological
    Learning: Proceedings of the 6th Workshop of the ACL Special Interest
    Group in Computational Phonology (SIGPHON), Philadelphia, July 2002, pp.
    21-30. Association for Computational Linguistics.
    * Creutz & Lagus (2005a). Mathias Creutz and Krista Lagus:
    "Unsupervised Morpheme Segmentation and Morphology Induction from Text
    Corpora Using Morfessor 1.0". Publications in Computer and Information
    Science, Report A81, Helsinki University of Technology, March 2005.
    * Davies (2003). Mark Davies: "Annotation without lexicons. an
    alternative to the standard bootstrapping approach". In: Dawn Archer,
    Paul Rayson, Andrew Wilson and Tony McEnery (eds.): Proceedings of the
    Corpus Linguistics 2003 conference. UCREL technical paper number 16.
    UCREL, Lancaster University. pp. 583-590.
    * Federici & Pirelli (1992). Stefano Federici & Vito Pirelli (1992):
    "A Bootstrapping strategy for Lemmatisation: Learning Through Examples".
    In: Kiefer et al (1992). pp. 123�35.
    * Freitag (2005). Dayne Freitag: "Morphology Induction from Term
    Clusters". Proceedings of the Ninth Conference on Computational Natural
    Language Learning (CoNLL-2005), pp. 128-135. Ann Arbor, MI, June 2005.
    * Goldsmith (2000). Goldsmith, John. "Linguistica: An Automatic
    Morphological Analyzer". To appear in: John Boyle, Jung-Hyuck Lee, and
    Arika Okrent: Papers from the 36th Meeting of the Chicago Linguistics
    Society [CLS 36], Volume 1: The Main Session. 2000.
    * Goldsmith (2001). Goldsmith, John: "Unsupervised learning of the
    morphology of a natural language". In: Computational Linguistics Vol.
    27, Nr. 2, 2001, p. 153 - 198.
    * Goldsmith et al (2005). Goldsmith, John; Hu, Yu; Matveeva, Irina &
    Sprague, Colin. A heuristic for morpheme discovery based on string edit
    distance. Technical report TR-2005-04, Department of Computer Science,
    University of Chicago.
    * Maxwell (2002). Mike Maxwell: Resources for Morphology Learning
    and Evaluation. In: Gonzalez Rodriguez, Manuel; Suarez Araujo, Carmen
    Paz (eds.): LREC 2002: Third International Conference on Language
    Resources and Evaluation, Vol. III. Paris 2002: ELRA, 967-974.
    * Novák et al (2003). Attila Novák, Viktor Nagy & Csaba Oravecz:
    "Corpus assisted development of a Hungarian morphological analyser and
    guesser". In: Dawn Archer, Paul Rayson, Andrew Wilson and Tony McEnery
    (eds.): Proceedings of the Corpus Linguistics 2003 conference. UCREL
    technical paper number 16. UCREL, Lancaster University. pp. 583-590.
    * Novák et al (2004). Attila Novák, Viktor Nagy & Csaba Oravecz:
    "Combining symbolic and statistical methods in morphological analysis
    and unknown word guessing". In: Proceedings of LREC 2004, Lisbon, 2004.
    * Oflazer et al (2001). Kemal Oflazer, Sergei Nirenburg, Marjorie
    McShan: "Bootstrapping Morphological Analyzers by Combining Human
    Elicitation and Machine Learning". Computational Linguistics 27.1, 2001,
    59-86.
    * Reichel & Weilhammer (2004). Uwe D. Reichel & Karl Weilhammer:
    "Automated Morphological Segmentation and Evaluation". In: Proceedings
    of LREC 2004, Lisbon, 2004.
    * Stroppa & Yvon (2005). Nicolas Stroppa, François Yvon: "An
    Analogical Learner for Morphological Analysis". In: Proceedings of the
    Ninth Conference on Computational Natural Language Learning
    (CoNLL-2005). Ann Arbor, Michigan, 2005: Association for Computational
    Linguistics. pp. 120�27.
    * Yarowsky & Wicentowski (2000). D. Yarowsky & R. Wicentowski:
    "Minimally Supervised Morphological Analysis by Multimodal Alignment".
    Proceedings of ACL-2000. San Francisco/CA 2000: Morgan Kaufmann, 207-216.

      Xuri Tang
       
      Wuhan University of Science and Engineering
      Wuhan, P.R. China

                    
    ---------------------------------
    ÇÀ×¢ÑÅ»¢Ãâ·ÑÓÊÏä-3.5GÈÝÁ¿£¬20M¸½¼þ£¡



    This archive was generated by hypermail 2b29 : Thu May 11 2006 - 05:10:12 MET DST