Re: Corpora: Corpus Linguistics User Needs

Torbjörn Lager (lager@ling.gu.se)
Thu, 30 Jul 1998 13:35:14 +0200

Oliver & Ylva,

You might also want to have a look at my thesis:

http://www.ling.gu.se/~lager/taglog.html

Here's a part of the abstract:

"The purpose of this thesis is to build a corpus theory development
environment -- to discuss its design, use, and implementation. The
proposed system is based on a logical approach to computational corpus
linguistics where sentences of logic are used to express statements
about texts and logical inference is used to manipulate these sentences
in order to analyse the texts.

The thesis demonstrates the remarkable ease with which the
functionalities needed in a corpus system can be implemented when based
upon adequate means of representing, querying, and reasoning. The
proposed system implements hand coding, searching, concordancing,
parsing, counting, tabling, collocating, automatic part-of-speech
tagging, lemmatizing, excerpting, interpreting, treebanking,
explanation, and various kinds of learning.

By linking all this functionality into a common representational
framework characterised by high expressive power, declarativity, and
explicit reasoning strategies, and by embedding the whole concept in a
particular philosophical and methodological context, including an
ontology of text, an analysis of the notion of theory, an explication of
the notion of truth, and other foundational issues, we arrive at an
interactive system which is multi-functional and general, yet simple,
consistent, and highly usable."

Best regards,
Torbjörn

Torbjörn Lager
Dept. of Linguistics
Uppsala University