Improving Domain-independent Cloud-based Speech Recognition with Domain-dependent Phonetic Post-processing
Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI-14),
Editors: Carla E. Brodley and Peter Stone and Program Cochairs,
pages 1529--1535,
- Jul 2014
Automatic speech recognition (ASR) technology has
been developed to such a level that off-the-shelf distributed speech recognition services are available (free
of cost), which allow researchers to integrate speech
into their applications with little development effort or
expert knowledge leading to better results compared
with previously used open-source tools.
Often, however, such services do not accept language
models or grammars but process free speech from any
domain. While results are very good given the enormous size of the search space, results frequently contain
out-of-domain words or constructs that cannot be understood by subsequent domain-dependent natural language understanding (NLU) components. We present a
versatile post-processing technique based on phonetic
distance that integrates domain knowledge with opendomain ASR results, leading to improved ASR performance. Notably, our technique is able to make use of
domain restrictions using various degrees of domain
knowledge, ranging from pure vocabulary restrictions
via grammars or N-Grams to restrictions of the acceptable utterances. We present results for a variety of corpora (mainly from human-robot interaction) where our
combined approach significantly outperforms Google
ASR as well as a plain open-source ASR solution.
@InProceedings{TBHW14, author = {Twiefel, Johannes and Baumann, Timo and Heinrich, Stefan and Wermter, Stefan}, title = {Improving Domain-independent Cloud-based Speech Recognition with Domain-dependent Phonetic Post-processing}, booktitle = {Proceedings of the 28th AAAI Conference on Artificial Intelligence (AAAI-14)}, editors = {Carla E. Brodley and Peter Stone and Program Cochairs}, number = {}, volume = {}, pages = {1529--1535}, year = {2014}, month = {Jul}, publisher = {AAAI Press, Palo Alto, US}, }