Re: [Corpora-List] starting a machine translation project

From: Nano Surbakti (nano.surbakti@gmail.com)
Date: Wed Sep 13 2006 - 12:58:28 MET DST

  • Next message: nadamides@aslib.com: "[Corpora-List] Conference: Translating and the Computer 28"

    Hi,

    On 9/13/06, Philipp Koehn <pkoehn@inf.ed.ac.uk> wrote:
    > While documentation is written with the assumption of foreign-English
    > translation, you may use it for any language direction. We have built
    > many MT systems with target languages other than English.
    Great..!!

    > Given the parallel corpus, about a day :)
    > Practically there will be many issues in preparing the data
    > in appropiate form etc. There may be spelling and font issues
    > with Indonesian, you may not have the data in the required
    > setence-aligned format, getting familiar with the tools may take
    > a while...
    Based on your experience, is it a minimum number of words or sentences
    in a corpus to produce a basic translation service? If the purpose is
    for daily language use, is it enough to use an English-Indonesian
    Bible as a corpus?

    >
    > Regards,
    > Philipp Koehn

    Thanks for helping,

    --
    Nano Surbakti
    



    This archive was generated by hypermail 2b29 : Wed Sep 13 2006 - 12:56:19 MET DST