Execute demo4.sh in the FeatureExtraction directory. An execution example is shown in Figure 14.24. After the execution, a file named MFBANK28_0.spec is generated. This file stores little endian 28 dimensional vector sequence expressed in the 32 bit floating-point number format. When separation cannot be performed well, check if the f101b001.wav files are in the data directory.
Twelve modules are included in this sample is dozen. There are three modules in MAIN_LOOP (iterator) and nine modules in MAIN (subnet). MAIN (subnet) and MAIN_LOOP (iterator) are shown in 14.25 and 14.26. As an outline of the processing, it is simple network configuration in which acoustic features are calculated in MSLSExtraction with the audio waveforms collected in the AudioStreamFromWave module and are written in SaveFeatures . Since MSLSExtraction requires the outputs of the mel-scale filter bank and power spectra for calculation of MSLS, the collected audio waveforms are analyzed by MultiFFT and their data type are converted by MatrixToMap and PowerCalcForMap , and then processing to obtain outputs of the mel-scale filter bank is performed by MelFilterBank . Here, the USE_POWER property of MSLSExtraction is set to trueand to output the power term at the same time. MSLSExtraction reserves a storing region for the MSLS coefficient other than the MSLS coefficient and outputs vectors as a feature (zero is in the storing region for the MSLS coefficient). Since the USE_POWER property is set to true, a storing region of MSLS and the delta power term is secured for the coefficient. Therefore, vectors that are double of the values specified in the FBANK_COUNT property of MSLSExtraction +1 are output as a feature. The MSLS coefficient and delta power term are calculated and stored with Delta . SaveFeatures saves the input FEATURE. The localization result from the front generated by ConstantLocalization is gave to SOURCES.
Table 14.16 summarizes the main parameters.
Node name |
Parameter name |
Type |
Value |
MAIN_LOOP |
LENGTH |
2 |
|
ADVANCE |
3 |
||
SAMPLING_RATE |
4 |
||
FBANK_COUNT |
5 |
||
FBANK_COUNT1 |
6 |
||
DOWHILE |
(empty) |
||
FBANK_COUNT |
FBANK_COUNT |
||
NORMALIZE_MODE |
Cepstral |
||
USE_POWER |
true |
||
FBANK_COUNT1 |
FBANK_COUNT1 |