This node generates mask values for dynamic logarithmic power, which is a kind of acoustic feature. The generated mask value is added to the mask vector element of an input.
No files are required.
When to use
This node is used to perform speech recognition by masking features depending on reliability based on missing feature theory. It is usually used for the posterior half of DeltaMask .
Typical connection
Map<int, ObjectRef> type. A pair of the sound source ID and mask vector of the feature as Vector<float> type data. The mask value is a real numbers from 0.0 to 1.0. 0.0 indicates the feature is not reliable and 1.0 indicates it is reliable.
Output
Map<int, ObjectRef> type. A pair of the sound source ID and mask vector of the feature as Vector<float> type data. The mask value is a real numbers from 0.0 to 1.0. 0.0 indicates the feature is not reliable and 1.0 indicates it is reliable. The dimension size is one more than the input dimension.
Parameter
This node generates a mask value of the dynamic logarithmic power, which is one of the acoustic features. The mask value generated is 1.0 consistently. The dimension of the output mask is the mask’s dimension+1.