Corpus of European Portuguese Stops
Introduction
A corpus of 54 European Portuguese real words containing /p, b, t, d, k, g/ was
designed to study the temporal and spectral characteristics of stop consonants.
Corpus 1 and 2
The corpus contained an equal number (eighteen) of words with stops in: initial
position, followed by the vowels /a/, /i/ and /u/; medial position, preceded by the
vowels /a/, /i/ and /u/ and followed by the vowel /ɐ/; final position, preceded
by the vowels /ɔ/ and /a/. The words were produced without any context
(Corpus 1) and within the frame sentence "Diga,... por favor." (Corpus 2).
Recording Method
The subjects were six native speakers of European Portuguese, three men LJ, HR
and PA (aged: 25 to 34) and three women ML, IM and SC (aged: 24 to 42), all
without any speech, language or hearing problems. Subjects LJ, HR, ML and IM
are from the city of Aveiro (at the centre of Portugal) and Subjects PA and SC
are from Porto (in the north of Portugal).
The corpus was recorded using a Philips SBC ME 400 unidirectional condenser
microphone located 20 cm in front of the subject's mouth. A laryngograph signal (Lx)
was also collected using a laryngograph processor (model EG-PC3 produced by
Tiger DRS, Inc., USA). The acoustic and Lx signals were pre-amplified
(Rane MS 1-b) and recorded with a Sony PCM-R300 DAT recorder, each
with 16 bits and a sampling frequency of 48 kHz.
Segmentation and Annotation Method
The corpus was manually segmented with Adobe Audition. The words were then
analyzed using Speech Filling System (SFS) and the following acoustic events were
annotated:
-
The beginning of preceding vowel (IV1) was defined as the instant in time at which
the second formant intensity becomes characteristic for a vowel.
-
The end of preceding vowel and beginning of closure (IO) was marked where the
second formant was no longer visible.
-
The beginning of prevoicing (IPV) was defined as the instant in time at which
evidence of vocal fold vibration could be observed both in the acoustic and Lx signals.
-
The voice offset (FV) was marked at the point where the vocal folds ceased to
vibrate, as can be seen in the Lx signal in Figure 1 (the periodic signal ends).
-
The end of prevoicing (FPV) was defined as the instant in time where the burst started.
-
The end of closure and the beginning of the release (IR) was defined by a sudden
peak in the waveform and as a vertical bar in the spectrogram. When there were
multiple bursts, the one with the highest intensity was chosen.
-
The end of release and beginning of following vowel (FR) was marked where the
second formant amplitude reached typical values for a vowel.
-
The end of the following vowel (FV2) was set where the second formant was no
longer visible.
In the annotation files we also registered the position in word (initial (0), medial (1)
and final (2)) and the type of voicing according to criteria defined by
Jesus and Shadle (2002):
-
When the duration of voicing was absent, the voiceless stops (0).
-
When voicing was present during < 1/3 of the closure interval, the stop was
classified as devoiced (1).
-
When voicing lasted from 1/3 to 1/2 of the closure duration, the stop was classified
as partially devoiced (2).
-
When the duration of voicing was > 1/2 of the closure duration, the stop was
classified as voiced (3).
Corpora Documentation and Distribution
Complete listing of all recorded material can be found in the
M.Sc. Thesis by Lousada (2006).
The corpora can be stored in one CD-ROM that contains the following data:
-
The .wav files contain the acoustic speech signal in the left channel and the
laryngograph signal on the right channel, recorded at 16 bits, with a sampling
frequency of 48 kHz.
-
The files with the extension .seg are annotation files. In Corpus 2 there are
10 time values (in seconds) referring to:
-
The beginning of preceding vowel
- The end of preceding vowel and beginning of closure
- The beginning of prevoicing (IPV)
- The voice offset (FV)
- The end of prevoicing (FPV)
- The end of closure and the beginning of the release (IR)
- The end of release and beginning of following vowel (FR)
- The end of the following vowel (FV2)
- Position in word
- Type of voicing
- The .ipa files contain a SAMPA phonetic transcription of the speech segments
contained in the .wav with the same name.
When the annotations points 1 to 8 were not observed their value was set to zero.
Last updated 25/6/2007
lmtj@ua.pt
Luis Miguel Teixeira de Jesus
Back to Homepage