Tux Droid VoiceXML Web Browser

Synopsis

A VoiceXML web browser using a "Tux Droid" as the human-computer interface.

Codename

Vux

Rationale

A proof-of-concept for the multimodal web and a bit of fun.

Features

  • Audio output via audio playback and TTS (text to speech)
  • Input via the XML form of DTMF grammars described in the W3C Speech Recognition Grammar Specification, triggered by buttons on the Tux Droid remote.
  • Input via speech recognition grammar data in the XML Form of the W3C Speech Recognition Grammar Specification.
  • Record audio received from the user

Extra input and output

Input

  • Push button on head - home
  • Menu button - home
  • Buttons 1-9, *, #, red, green, blue and yellow - DTMF tones
  • Left direction button - back
  • Right direction button - forward
  • Vol+ & Vol- - volume control

Output

  • Beak movements during TTS audio output
  • Error, status and notification messages using flashing eyes, moving wings and spinning

Use Cases

  • Voice webmail - reading email messages
  • Voice interface for home automation
  • Voice interface for music collection

Implementation

Possibly a front end to an existing VoiceXML interpreter such as Public VoiceXML which uses OpenVXI.

Developer Resources