Resources

EEE_181 (1)

Speech corpora

The following corpora were developed as part of funded projects and are being made freely available to other researchers (for non-commercial purposes), but without further support. Please contact me for further information.

UCL Speaker Database: A corpus containing audio recordings of a wide range of speech materials (words, sentences, read texts, picture descriptions) for 45 speakers of South-Eastern British English.

LUCID: A corpus containing DiapixUK conversations in good and challenging conditions, read sentences and picture naming for 40 native southern British English speakers.

kidLUCID: A corpus  containing DiapixUK conversations in good and challenging conditions for 96 children and adolescents aged between 9 and 14 (46M, 50F, mean age: 11;8 years, range 9;0 to 15;0 years) who were native southern British English speakers.

elderLUCID: A corpus containing Diapix UK conversations and read BKB sentences produced in good and challenging conditions for 83 Southern British English adult talkers. These included ‘older adults’ (OA) between 64-84 years of age (N=57; 30 F) and ‘younger adults’ (YA) between 19-26 years of age (N=26, 15 F).  OA participants were further subdivided into two groups according to their hearing status. OANH participants (N=27; 14F) had normal hearing while OAHL participants (N=30; 16F) had a mild acquired hearing loss.

Test materials

DiapixUK picture materials: Picture materials used for the elicitation of spontaneous speech dialogues between two speakers in a collaborative task (see Baker and Hazan 2011 BRM). Examples of recordings made using Diapix can be found here

Short demo of how Diapix is used in our research