[Corpora-List] Croatian Corpus

From: Anthony Weaver (aweaver@cs.sunysb.edu)
Date: Wed Apr 09 2003 - 18:13:41 MET DST

  • Next message: Mike Maxwell: "Re: [Corpora-List] Croatian Corpus"

            I am doing some speech recognition for Croatian and I would like
    to know if there is a freely available corpus for Croatian? I am
    specifically looking for a text corpus, maybe no smaller than about 20K
    words.

     I would also like to know if there are any papers discussing Croatian pronunciation?
    More specifically, it has been explained to me by a native speaker, and on
    various sites on the web that Croatian pronunciation is mostly
    unambiguous, but I have been unable to find any papers/research that would
    support or refute this claim. In English, each letter can have multiple
    pronunciations, but this does not seem to occur for Croatian. All help is
    greatly appreciated.

    Tony



    This archive was generated by hypermail 2b29 : Wed Apr 09 2003 - 18:16:15 MET DST