The combining of XHTML and VoiceXML to provide websites with voice capabilities. This multimodal capability enables handheld devices that browse the Web to interact with voice instead of the screen. Also known as "X+V," XHTML+Voice enables VoiceXML event handlers to be implemented via the event handling capability of the Document Object Model (DOM). See XHTML, VoiceXML and DOM.