Re: [Corpora-List] Query about corpora of spoken English

From: Dr Wendy Anderson (W.Anderson@englang.arts.gla.ac.uk)
Date: Mon Dec 05 2005 - 10:21:15 MET

  • Next message: Dr Wendy Anderson: "Re: [Corpora-List] Query about corpora of spoken English"

    Dear all,
    I had sent a similar message to Nicolas Ballier, but since there seems to be
    some interest in aligning speech files and transcriptions I thought I should
    post to the list too.
    The SCOTS corpus, at the University of Glasgow, contains texts in spoken
    Scottish English, as well as written texts. Our website is at:
    www.scottishcorpus.ac.uk. At present we have about 50 spoken documents (and
    growing): these range between Scottish
    Standard English and dialects of the Scots tongue. The corpus is freely
    available and can be searched online (or files downloaded if you prefer).
    The sound (or video) files are aligned with an orthographic transcription,
    which enables the user to click on a word and go directly to that utterance
    in the sound file, or, vice versa, go to a point in the audio file and
    scroll directly to the equivalent part of the transcription. We use Praat to
    make time-aligned transcriptions.
    Please do get in touch if you would like to know more. We gave a paper on
    this subject at this summer's Corpus Linguistics 2005 - it doesn't seem to
    be available online yet, but I can send a copy to anyone who is interested.

    regards
    Wendy Anderson
    ....................................
    Dr Wendy J. Anderson
    Research Assistant
    Scottish Corpus of Texts and Speech
    Department of English Language
    University of Glasgow
    12 University Gardens
    Glasgow
    G12 8QQ
    Scotland, UK

    Website: http://www.scottishcorpus.ac.uk
    ----- Original Message -----
    From: "joshua raclaw" <Joshua.Raclaw@colorado.edu>
    To: <R.M.Salkie@bton.ac.uk>
    Cc: <CORPORA@UIB.NO>
    Sent: Friday, December 02, 2005 4:02 PM
    Subject: Re: [Corpora-List] Query about corpora of spoken English

    > I'm not currently aware of any collection of spoken English corpora like
    that -
    > if you could, please send any responses to the list and to Nicolas.
    >
    > Joshua



    This archive was generated by hypermail 2b29 : Mon Dec 05 2005 - 11:14:25 MET