研究目的
To apply the turbo principle from digital communications to the domain of automatic speech recognition (ASR) to improve performance by integrating further sources of information such as additional modalities, or acoustic channels, or acoustic models.
研究成果
The paper concludes that the turbo ASR approach significantly improves performance over conventional methods for information fusion, with a relative word error rate (WER) reduction of 22.4% for audio-visual tasks and 18.2% for audio-only tasks on average over all SNR conditions and investigated noise types.
研究不足
The paper does not explicitly mention limitations, but the complexity and computational requirements of the turbo ASR approach, especially for large vocabulary continuous speech recognition (LVCSR), could be considered potential limitations.