Corpora: Thank you-Results-Spanish Examples

From: Linda Bawcom (
Date: Thu Sep 27 2001 - 14:51:01 MET DST

  • Next message: Stina Karlsson: "Corpora: Compounds: Thank you!"

    First in English then in Spanish

    Dear colleagues:

    I would like to thank the people listed below for taking time out from their busy schedules to give me on-line addresses of Spanish corpra (which I will share with my Spanish teaching colleagues) and to send me examples of strings which I had requested. For those of you who may be interested I received the following information. If I have forgotten anyone, my sincerest apologies.
    Best wishes,

     Estimados colegas:
     Me gustaría dar las gracias a los siguientes por haber tomado el tiempo de sus apretadas agendas para dar me los 'on-line' direcciones de corpra en español (lo cual me compartiré con mis colegas que enseñan español) y para enviarme ejemplos que yo había pedido. Si estas interesado, recibe la siguiente información. Si he olvide alguien, mis mas sinceros disculpas.



    ON-LINE ADDRESSES Antoinette Renouf Bill Fletcher
    From Mark Davies I received the following:

    Feel free to take a look at, which is a
    small three million word sample corpus that shows some of the functionality
    that will be part of the 100 million word corpus of Spanish that I'm
    creating as part of an NEH-funded project.

    One of the nice things about this corpus is that for any one of 30,000
    different lemma, you can input the lemma and it will show you all of the
    synonyms in the corpus, and these can be limited by frequency (e.g. all of
    the synonyms of "triunfar" that occur at least 5 times in the 1900s). You
    can also search by part of speech (35 different categories) as well as by
    lemma (e.g. all forms of a given verb). In the case of your query, for
    example, you could look for < haber.* que > and it will give you the
    construction in all of the different tenses. Knut Hofland Knut Hofland Eliska Bokova and the person who maintains it is
    Antonio Ruiz Tinoco Constantin Orasan

    From Aquilino Sánchez Pérez I received the following information which I'll (try) to translate.

    Te aconsejo que adquiers el Gran Diccionario de Uso del Español Actual, de
    la editorial SGEL s.a. Con él te regalarán un corpus del español de 2
    millones de palabras, con el gestor incluido. En el corpus Cumbre, de 20 millones de palabras, TAL VEZ y QUIZÁ(s) ofrecen
    datos muy abundantes, como verás en los ficheros que te adjunto.

    I suggest you get thel Gran Diccionario de Uso del Español, from the publishers SGEL S.A. With this you will receive a Spanish corpus of 2 million words free of charge, including the program (I'm not sure I've translated that right). In the Cumbre corpus of 20 million words TAL VEZ y Quizá give abundant results, as you will see in the files which I have attached.

    Susana Sotillo offered to look in her corpus and Dr. Leonel Ruis who wrote the following:
    Tengo un corpus escrito de la prensa y escolares cubanos. Si desea, la podemos ayudar al respecto.

    In addition, Rachel Virgínia Xavier Aires has a Portugese corpus for those of you are interested.

    Again, thank you all very much for your response.

    This archive was generated by hypermail 2b29 : Thu Sep 27 2001 - 14:40:35 MET DST