[Corpora-List] Matching software

From: Nigel Bruce (njbruce@hkucc.hku.hk)
Date: Wed Dec 28 2005 - 08:56:07 MET

  • Next message: Uwe Quasthoff: "[Corpora-List] CFP: LREC2006 Workshop on "Quality assurance and quality measurement for language and speech resources""

    I am looking for software that will operate a match between input text - up
    to 2,000 words, say - and a corpus in a similar manner to Turnitin
    - except that I will build and control the relevant d-bases the programme
    will use.
    Another key difference is that whereas Turnitin searches for matches of
    strings of over 7 words between input and d-base, I'm looking for "match
    failure" of either 2 or 3 words between input and d-base.
    I guess you could say I'm looking for an engine that will give me Turnitin
    in reverse, picking up the absence of a match between input and corpus, and
    colour-coding it.
    Any suggestions appreciated.
    Nigel Bruce, Hong Kong
    .

    _________________________________________

    Nigel Bruce
    English Centre
    7/F, K.K. Leung Bdg.
    University of Hong Kong,
    Pokfulam Road,
    HONG KONG

    E-mail: njbruce@hku.hk
    http://ec.hku.hk/njbruce/
    Office Tel.: (852) 2859.2023; Fax: (852) 2547.3409



    This archive was generated by hypermail 2b29 : Wed Dec 28 2005 - 09:24:09 MET