[Corpora-List] ERRATUM: ELRA - Language Resources Catalogue - Update

From: ELDA (info@elda.org)
Date: Tue Jun 20 2006 - 14:54:57 MET DST

  • Next message: ELDA: "[Corpora-List] RIAO 2007 Call for papers, Applications & Dates"

    ERRATUM: A wrong layout of this announcement was posted to you earlier
    today. The current posting contains a more useful layout. Please
    discard the previous posting. Sorry for any inconvenience this may have
    caused you.

    Our apologies if you have received multiple copies of this announcement

    *******************************************************************
    ELRA - Language Resources Catalogue - Update
    *******************************************************************
    We are happy to announce that new Text and Speech Language Resource are
    now available in our catalogue.
    To view all the Language Resources available, you can visit our on-line
    catalogue : http://catalog.elda.org/index.php?language=en

    *** L0067 English lexicon with morphological information ***
    This English lexicon is made up of 174,000 inflected forms corresponding
    to 68,000 simple word lemmas (including 31,900 nouns, 11,800 verbs,
    19,900 adjectives, 4,100 adverbs, 300 pronouns, articles,
    prepositions/postpositions and conjunctions). Each line in the resource
    file shows an inflected form, its part of speech, its related lemma and
    its morphological information.
    For more information, see
    http://catalog.elda.org:8080/product_info.php?products_id=867&osCsid=0a57b78fd3504ecf1c75825782d061de

    *** L0068 French lexicon with morphological information ***
    This French lexicon is made up of 424,000 inflected forms corresponding
    to 55,000 simple word lemmas (including 34,400 nouns, 7,300 verbs,
    11,700 adjectives, 1,400 adverbs, 200 pronouns, articles,
    prepositions/postpositions and conjunctions). Each line in the resource
    file shows an inflected form, its part of speech, its related lemma and
    its morphological information.
    For more information, see
    http://catalog.elda.org:8080/product_info.php?products_id=868&osCsid=0a57b78fd3504ecf1c75825782d061de

    *** L0069 Italian lexicon with morphological information ***
    This Italian lexicon is made up of 862,500 inflected forms corresponding
    to 112,000 simple word lemmas (including 66,340 nouns, 12,030 verbs,
    28,080 adjectives, 4,890 adverbs, 660 pronouns, articles,
    prepositions/postpositions and conjunctions). Each line in the resource
    file shows an inflected form, its part of speech, its related lemma and
    its morphological information.
    For more information, see
    http://catalog.elda.org:8080/product_info.php?products_id=869&osCsid=0a57b78fd3504ecf1c75825782d061de

    *** L0070 Italian lexicon with morphological information and clitic
    verbs ***
    This Italian lexicon is the same as the one described in ELRA-L0069, but
    with the addition of clitic verbs, which increases the number of
    inflected forms to 1,800,000 (still corresponding to 112,000 simple
    words lemmas). It contains 66,340 nouns, 12,030 verbs, 28,080
    adjectives, 4,890 adverbs, 660 pronouns, articles,
    prepositions/postpositions and conjunctions. Each line in the resource
    file shows an inflected form, its part of speech, its related lemma and
    its morphological information.
    For more information, see
    http://catalog.elda.org:8080/product_info.php?products_id=870&osCsid=0a57b78fd3504ecf1c75825782d061de

    *** L0071 Spanish lexicon with morphological information ***
    This Spanish lexicon is made up of 816,000 inflected forms corresponding
    to 104,000 simple word lemmas (including 52,000 nouns, 9,800 verbs,
    21,200 adjectives, 20,500 adverbs, 500 pronouns, articles,
    prepositions/postpositions and conjunctions). Each line in the resource
    file shows an inflected form, its part of speech, its related lemma and
    its morphological information.
    For more information, see
    http://catalog.elda.org:8080/product_info.php?products_id=871&osCsid=0a57b78fd3504ecf1c75825782d061de

    *** S0217 BITS Logatome Synthesis Corpus ­ BITS-LG ***
    This corpus contains 11,036 recordings of logatomes spoken by 4
    professional German speakers covering all German diphone combinations as
    well as the most prominent combination German - French - English. Each
    logatome was recorded in three channels: close microphone, large
    membrane microphone and laryngographic signal. All diphones are
    segmented and labelled into phonemic units.
    For more information, see
    http://catalog.elda.org:8080/product_info.php?products_id=866&osCsid=0a57b78fd3504ecf1c75825782d061de

    For more information on the catalogue, please contact Valérie Mapelli
    mailto:mapelli@elda.org



    This archive was generated by hypermail 2b29 : Tue Jun 20 2006 - 15:07:57 MET DST