TechNews logo

New Features for VoiceXML and Smarter Speech Recognition

Published Thu, 2007-06-21 15:28

W3C has published standards that enhance the capabilities and interoperability of voice browsers and speech recognition systems. The W3C Voice Browser Working Group has completed work on both VoiceXML 2.1 and Semantic Interpretation for Speech Recognition (SISR) 1.0.

VoiceXML 2.1 extends the widely used VoiceXML 2.0 dialog language to include commonly implemented features. These features include dynamic references to grammars and scripts, detecting when during a prompt the user barges-in, and processing multiple sets of data from the server in a single access. All VoiceXML 2.0 applications will work under VoiceXML 2.1 without modification, allowing for significant interoperability.

Semantic Interpretation for Speech Recognition (SISR 1.0) enables developers to extract and translate textual representations of words recognized by a speech recognition system and structure the results into a format convenient for processing by the speech application. For example, with SISR, one could specify the instructions for converting the spoken sentence "I want to fly from Los Angeles to Seattle." to a data structure containing "departure: LAX" and "destination: SEA" .
Speech Interface Framework Built by International Leaders in Research and Industry

The W3C Speech Interface Framework includes the completed standards VoiceXML 2.1, SISR 1.0, Speech Recognition Grammar Specification (SRGS 1.0), and Speech Synthesis Markup Language (SSML 1.0). W3C anticipates that the next pieces of the Speech Interface Framework to become W3C Recommendations are CCXML, Pronunciation Lexicon Specification (PLS 1.0) and SSML 1.1. The Voice Browser Working Group is also developing State Chart XML (SCXML) and VoiceXML 3.0.


Post new comment

The content of this field is kept private and will not be shown publicly.