This node determines an optimum gain for how much of the power spectrum of estimated noise is to be removed from a power spectrum including signals + noise. Further, it outputs the probability of speech presence (See Section 6.3.7). However, this node constantly outputs 1 as the probability of speech presence. It outputs the difference between a separated sound’s power spectrum and the estimated noise’s power spectrum.
No files are required.
When to use
It is used when performing noise estimation with the HRLE node.
Typical connection
Figure 6.38 shows a connection example of CalcSpecSubGain . Inputs are power spectra after separation with GHDSS and those of the noise estimated in HRLE . Its outputs connect VOICE_PROB and GAIN to SpectralGainFilter .
Input
Map<int, ObjectRef> type. Vector<float> type data pair of a sound source ID and a power spectrum of the separated sound.
Map<int, ObjectRef> type. Vector<float> type data pair of a sound source ID and a power spectrum of the estimated sound.
Output
Map<int, ObjectRef> type. Vector<float> type data pair of a sound source ID and a power spectrum of the probability of speech presence.
Map<int, ObjectRef> type. Vector<float> type data pair of a sound source ID and a power spectrum of the optimum sound.
Map<int, ObjectRef> type. Vector<float> type data pair of the sound source ID and the power spectrum of the separated sound with the estimated noise deducted.
Parameter
No prameters.
This node determines an optimum gain for how much of the estimated noise’s power spectrum of the estimated noise is to be removed when a noise power spectrum is removed from a power spectrum of signals + noise. It also outputs the probability of speech presence. (See Section 6.3.7.) However, this node constantly outputs 1 as the probability of speech presence. It outputs the difference of the power spectrum of the separated sound and that of the estimated noise. Assuming that the power spectrum from which noise was is , the power spectrum of the separated sound is and that of the noise estimated is , the output from OUTPUT_POWER_SPEC is expressed as follows.
(29) |
Here, indicates an analysis frame number. indicates a frequency index. The optimum gain is expressed as follows.
(30) |
When processing simply with , power can become negative. The purpose of this node is to calculate a gain for removing power spectra of noise beforehand so that power cannot be negative, since it might become difficult to treat such a power spectrum in subsequent processing.