Re: Corpora: Corpus Linguistics User Needs

Henning Reetz (Henning.Reetz@uni-konstanz.de)
Wed, 29 Jul 1998 12:50:41 +0200

in reply to Geoffrey Sampson:

1)
Writing a program is one thing. Testing and proving its correctness is
another thing. Even for simple statistical problems I prefer to use
standard statistical packages because I expect their algorithms to be
better tested than my own code (but I compare always their results with
examples from text books; if both disagree, I compute the problem on the
example data by hand and found more often bugs in the textbooks than in the
programs). Being an experiecend programmer having written many thousands
lines of code, I prefer to use standard software.

2)
I don't have to be a car mechanic to drive a car. Why do I have to be a
programmer to use a corpora? --- But I have to know as a driver what petrol
my car takes, how good the breaks are, etc. As a user of a program, I
cannot simple trust the program but have to be aware of its bugs or
problems. I think it is a good policy to test a function by hand on a small
data set and do cross-checks and plausibility tests on large data sets.

3)
Why re-invent the wheel?

|||
(o o)
----------------oOO--(_)--OOo-----------------
| |
________| Henning Reetz |________
\ | Allgemeine Sprachwissenschaft | /
\ | Universitaet Konstanz | /
\ | Fach D186, D-78457 Konstanz | /
\ | Phone:(49)7531-882928, Fax:883095 | /
/ | | \
/ -----------.oooO------------------------------ \
/ ( ) Oooo. \
/____________________) /______( )____________________________\
(_/ ) /
(_/

Anything that is good and useful is made of chocolate.