[Corpora-List] Matching software

From: Nigel Bruce (njbruce@hkucc.hku.hk)
Date: Wed Dec 28 2005 - 08:56:07 MET

Next message: Uwe Quasthoff: "[Corpora-List] CFP: LREC2006 Workshop on "Quality assurance and quality measurement for language and speech resources""

Previous message: Kevin Duh: "[Corpora-List] Tagset mapping (Negra -> Penn Treebank)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

I am looking for software that will operate a match between input text - up
to 2,000 words, say - and a corpus in a similar manner to Turnitin
- except that I will build and control the relevant d-bases the programme
will use.
Another key difference is that whereas Turnitin searches for matches of
strings of over 7 words between input and d-base, I'm looking for "match
failure" of either 2 or 3 words between input and d-base.
I guess you could say I'm looking for an engine that will give me Turnitin
in reverse, picking up the absence of a match between input and corpus, and
colour-coding it.
Any suggestions appreciated.
Nigel Bruce, Hong Kong
.

_________________________________________

Nigel Bruce
English Centre
7/F, K.K. Leung Bdg.
University of Hong Kong,
Pokfulam Road,
HONG KONG

E-mail: njbruce@hku.hk
http://ec.hku.hk/njbruce/
Office Tel.: (852) 2859.2023; Fax: (852) 2547.3409

Next message: Uwe Quasthoff: "[Corpora-List] CFP: LREC2006 Workshop on "Quality assurance and quality measurement for language and speech resources""
Previous message: Kevin Duh: "[Corpora-List] Tagset mapping (Negra -> Penn Treebank)"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Wed Dec 28 2005 - 09:24:09 MET