Re: Corpora: Portugese Corpora

Tony Berber Sardinha (tony4@uol.com.br)
Thu, 22 Oct 1998 09:30:45 -0200

Hi,

Here's a few references:

(a)
Listings:
A catalogue of NLP resources for Portuguese, listing
corpora, dictionaries, terminological databases, tools and other possible
pointers of interest: http://www.oslo.sintef.no/portug/recursos.html
Diana Santos and Signe Oksefjell
projecto@informatics.sintef.no

(b)
Research Groups & contacts:
CENTRIA: Nuno Miguel Cavalheiro Marques (nmm@fct.unl.pt) or Gabriel P.
Lopes (gpl@di.fct.unl.pt)
GETA: Paltonio Daun Fraga (paltonio@uol.com.br)
NILC: Maria das Graças Volpe Nunes (mdgvnune@icmsc.sc.usp.br)
Tycho-Brahe Corpus: Charlotte Galves (galvesc@iel.unicamp.br)
Processamento Computacional do Português: Diana Santos
(Diana.Santos@informatics.sintef.no)
LAEL: Tony Berber Sardinha (tony4@uol.com.br)

(c)
Sources of textual material & corpora:
(i) 'Folha de S.Paulo' newspaper: 4 annual CDROMs with full text
(publifolha@uol.com.br) (www.publifolha.com.br)
(ii) Corpus Borba-Ramsay Corpus. European Corpus Initiative. Multilingual
Corpus 1. HRCR, University of Edinburgh, and ISSCO, University of Geneva.
(iii) CBMP 'Corpus of Brazilian Media Portuguese' (no longer available
online...)
(iv) 'Exame' magazine: 1 CDROM with full 1994-1995 issues
(exame@email.abril.com.br) *email may be outdated
Post: Editora Abril, R do Curtume 585 6o. andar, 05065-001 Sao Paulo SP,
Brazil
(v) 'Almanaque Abril' electronic encyclopedia: Post: Editora Abril, R do
Curtume 585 6o. andar, 05065-001 Sao Paulo SP, Brazil
(vi) 'Encarta 98' encyclopedia Brazilian edition: www.microsoft.com

Hope this helps.

tony.
----------------------------------------------------------------------------
-------------------------------
Dr Tony Berber Sardinha
Catholic University of Sao Paulo, Brazil
tony4@uol.com.br
http://sites.uol.com.br/tony4/homepage.html
http://homepages.infoseek.com/~corpuslinguistics/homepage.html
----------------------------------------------------------------------------
-------------------------------

----------
> From: Siemund, Rainer <siemund@acn.be.philips.com>
> To: 'CORPORA@HD.UIB.NO'
> Subject: Corpora: Italian and Portugese Corpora
> Date: 21 October 1998 14:18
>
> Dear list members,
> I am looking for corpora of both spoken and written Italian and
Portugese. I
> had a look at the archive of postings to this list and found a few
queries
> relating to the two languages, but no summaries. I also had a look at the
> usual suspects, i.e. LDC, ELRA and the OTA, which do have some material.
> Does anyone know of other resources somewhere out there?
> Thanks in advance,
> Rainer
>
> ___________________________________________________________
>
> Rainer Siemund Tel: +49
> (0)241-88 71-392
> Philips Speech Processing Fax: +49
> (0)241-88 71-141
> Language Resources E-Mail:
> siemund@acn.be.philips.com
> Kackertstr. 10
> http:\\www.speech.philips.com
> D-52072 Aachen
> ___________________________________________________________
>
>