[Corpora-List] RE: Web/Corpora Questions

From: peetm (peet.morris@comlab.ox.ac.uk)
Date: Tue Sep 14 2004 - 12:09:27 MET DST

Next message: A.DeRoeck: "RE: [Corpora-List] corpus homogeneity"

Previous message: Adam Kilgarriff: "RE: [Corpora-List] corpus homogeneity"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Hi - sorry if this isn't exactly corpora-specific, but, I've a few questions
that I think members of this list might be able to help me with.

1. I'm looking for any articles/information on the application of
'grammar-analysis' to determine text-type/genre/style/register/other [delete
as appropriate!]

2. Are there any machine-readable lexicons/databases that contain
information like:

o - Indication of writer's/reader's 'Reading Age' (correct term?)

o - Common Synonyms

o - Common Misspellings

o - Rarity/Density (e.g., *this* word is used infrequently)

o - I know the last of these is typically ascertained through
corpus analysis, but I thought I'd ask anyway!

3. Does the WordNet database exist in any 'popular' format, e.g., Oracle,
SQL-Server, Access?

peetm

Next message: A.DeRoeck: "RE: [Corpora-List] corpus homogeneity"
Previous message: Adam Kilgarriff: "RE: [Corpora-List] corpus homogeneity"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Tue Sep 14 2004 - 12:03:56 MET DST