[Corpora-List] Language codes and spelling reform

From: Lars Aronsson (lars@aronsson.se)
Date: Fri Jan 26 2007 - 20:23:14 MET

  • Next message: Satoshi Sekine: "[Corpora-List] CFP: ACL-PASCAL Workshop on Textual Entailment and Paraphrasing"

    According to http://en.wikipedia.org/wiki/List_of_ISO_639-2_codes
    there are separate language codes for Old English (ang), Middle
    English (enm), Low German (nds), Old High German (goh), Middle
    High German (gmh) and Alemannic (gsw), in addition to today's
    English (en, eng) and German (de, deu, ger).

    But is there any systematic approach to name and identify the
    variants of German from the 19th century (Thier, illustrirte),
    20th century (Tier, illustrierte), and 1996 reform (f, sss)? Are
    there names and codes for the various historic stages of Nynorsk,
    Danish with "maae", pre-1906 Swedish with "dt", etc.?

    -- 
      Lars Aronsson (lars@aronsson.se)
      Aronsson Datateknik - http://aronsson.se
    



    This archive was generated by hypermail 2b29 : Fri Jan 26 2007 - 20:20:28 MET