Calibration Method

Introduction

A 94dB, 1000Hz calibration tone produced by a Bruel & Kjaer 4620 calibrator was recorded on the same tape on which speech was recorded, with the amplification varied by a known amount (see Tables 1 and 2).

Table 1: Recordings' settings. Speakers LMTJ, CFGA, ACC and ISSS.
DateSpeakerDAT Rec. Level Input GainOutput Gain
SpeechToneSpeechTone
6/11/1998LMTJ6 20102010
25/1/1999LMTJNot registered 20102020
22/6/1999ACC6 20102010
19/11/1999CFGA3.5 20103010
19/11/1999ISSS5 20102010

Table 2: Recordings' settings. Speakers PS and RS.
DateSpeakerLanguage DAT Rec. LevelInput Gain Output Gain
SpeechToneSpeechTone
17/11/2000RSEnglish4 20102010
17/11/2000RSPortuguese4 20102010
17/11/2000PSEnglish4 20102010
17/11/2000PSPortuguese4 20102010

To obtain an absolute spectral amplitude we will start by calculating a factor A1 which, when added to the internal arbitrary amplitude of the recorded calibration tone, makes the sum equal to the known amplitude of the calibration tone:

A1 = 94.1 - 20log(Yarb(1000)) (dB)

where Yarb(1000) is the arbitrary internal amplitude of the Fourier transform at 1kHz of the calibration tone. We will also have to calculate a second A2 that will be equal to the difference in amplification for the tone and speech:

A2 = Gcal - Gsp (dB)

where Gcal is the gain applied when the calibration signal was recorded, and Gsp is the gain applied when the speech signal was recorded. Therefore the absolute spectral amplitude of the speech signal Xarb(1000) is given by

Xabs = 20log(Xarb(1000)) + A1 + A2 (dB)

The spectra shown in the thesis by Jesus(2001) do not present an absolute amplitude. We are currently working on a method that uses the calibration signal to calculate an absolute spectral amplitude that will be referred to a 1Hz interval and will thus allow comparison regardless of window lengths and averaging techniques.

The power spectrum (energy) of the speech signal is defined as:

E=∫|x(t)|²dt=∫|X(f)|²df

If we increase the number of points in x(t) (i.e. the size of the window) the value of the integral (area delimited by the function) also increases. Therefore, the window length used to calculate the power spectra affects the overall amplitude. All else being equal, the larger the size of the window the higher is the overall amplitude.

We used the same window size to calculate the power spectra of ambient noise, sustained fricatives, fricatives in nonsense words and real words. We used a larger number of windows to calculate the averaged power spectrum of a longer segment of signal (ambient noise and sustained fricatives). This allowed us to compare spectral amplitudes of Corpus 1a, 1b, 2, 3 and 4, for a given recording session.


Last updated 25/6/2007
lmtj@ua.pt
Luis Miguel Teixeira de Jesus

Back to Homepage