The ability to handle (ignore?) structure tags, such as title, body of
document, etc., is not a big deal, as I think that they would be easy
to strip out in a preprocessing step
I realize that there have been a few relevant postings to this mailing list
of late -- a list of parsers was posted with respect to investigations
on the use of them on PCs under windows in April 1996, and Miles Osborne
posted a response suggesting the Brill parser. Unfortunately, other
responses were emailed to the original poster, but no copy was sent
to corpora and no summary was posted.
It seems that the topic of "what is a good cheap parser" comes up
periodically, so I would like to volunteer to gather people's
experiences -- good and bad -- and then post them.
Email your thoughts to me if you prefer (to save bandwidth) -- I will
post a summary of responses that I receive via email. If you prefer to
have your comments summarized anonymously, please indicate this
in your email.
Thanks.
Ray Liere
Department of Computer Science
Oregon State University, Corvallis, Oregon, USA
lierer@mail.cs.orst.edu