Takes multi-channel speech waveform data from multi microphone arrays. This node is an enhanced version of AudioStreamFromMic to correspond to multiple devices. MultiAudioStreamFromMic corrects the difference in the number of frames between microphone devices when data is received over a long period of time.
No files are required.
When to use
This node is used to deal with multi-channel speech waveform data from multi microphone arrays as the input for HARK system. Note that it requires that the all microphone arrays are the same model, or that they have the same specification.
Typical connection
Figure 6.7 shows a connection example of the MultiAudioStreamFromMic .
Parameter name |
Type |
Default value |
Unit |
Description |
LENGTH |
512 |
[pt] |
Frame length as a fundamental unit for processing. |
|
ADVANCE |
160 |
[pt] |
Frame shift length. |
|
CHANNEL_COUNT |
8 |
[ch] |
Microphone input channel number of a device to use. |
|
SAMPLING_RATE |
16000 |
[Hz] |
Sampling frequency of audio waveform data loaded. |
|
DEVICETYPE |
WS |
Type of device to be used. |
||
GAIN |
0dB |
Gain value used with RASP device. |
||
DEVICE |
/dev/null |
A list of identification names required to access the devices. |
||
FRAME_COUNT_SKEW_TOLERANCE |
5.0 |
[sec] |
The tolerance in frame number in seconds. |
Input
None.
Output
: bool type. This indicates whether there is still input from the waveform to be processed. Used as an ending flag when processing the waveforms in a loop. When it is true, waveforms are loaded, and when it is false, reading is complete. trueis output continuously.
: Matrix<float> type. Indexed, multichannel audio waveform data with rows as channels and columns as samples. Size of the column is equal to the parameter LENGTH. This output terminals is hidden by default. The output terminal corresponding to each device needs to be added manually.
Please see the figure 6.8 to add the output terminals to the node.
Parameter
: Refer the parameter of the AudioStreamFromMic node. These values are the same for each device.
: string type. A list of identification names required to access the device. Specify identification names of multiple devices separated by space characters. Each separated identification name of a device is the same as DEVICE of AudioStreamFromMic .
: float The tolerance in frame number in seconds.
This node is an enhanced version of AudioStreamFromMic to deal with multiple devices. Each device specified in the list of devices separated by blank space of the DEVICE parameter corresponds to each output terminal AUDIO arranged in ascending order of the serial number. In the FRAME_COUNT_SKEW_TOLERANCE parameter, specify the timing to correct the difference in the number of frames. When the difference between the maximum value and the minimum value in the speech waveform data from multiple devices reaches the value specified by FRAME_COUNT_SKEW_TOLERANCE, the data that is the maximum will be deleted by the amount equivalent to the value specified in the FRAME_COUNT_SKEW_TOLERANCE parameter.