RE: [Corpora-List] American and British English spelling converter

From: Harold Somers (harold.somers@manchester.ac.uk)
Date: Fri Nov 03 2006 - 11:47:29 MET

  • Next message: Martin Krallinger: "Re: [Corpora-List] American and British English spelling converter"

    It would be a grave mistake to think that the only difference between
    British and American English is a few wayward spellings. There are
    considerable and extensive lexical, grammatical and idiomatic
    differences. The 1st and 3rd of those are more or less well known, but
    the grammatical differences never cease to surprise me. I'd be
    moderately interested to see what other examples corpora listers come up
    with (though no doubt they will also remind me that there are
    significant differences in usage between American dialects, not to
    mention Canadian etc)

    To give just one example of each:

    Lift vs elevator
    Have you got vs do you have
    Half four vs 4:30

    Harold Somers

    > -----Original Message-----
    > >
    > > Martin Krallinger wrote:
    > >
    > >> Dear all,
    > >>
    > >> I was looking for some simple tool (preferable in Python) which is
    > >> able to do automatic conversion of texts (or words) from British
    > >> English (UK) to American (US) English and vice versa.
    > >> (Example: realize <-> realise)
    > >>
    > >> This seems to be an easy task, but I could not find any
    > ready to use
    > >> stand alone tool capable of performing this task.
    > >>
    > >> I want to integrate this application into an Information
    > extraction
    > >> system which handles scientific literature.
    > >>
    > >> I am also interested in references where aspects related to US/UK
    > >> English spelling has been analyzed in the context of information
    > >> extraction, text mining and terminology extraction.
    > >>
    > >> Best regards,
    > >>
    > >>
    > >> Martin
    > >>
    > >>
    > >
    > >



    This archive was generated by hypermail 2b29 : Fri Nov 03 2006 - 11:45:07 MET