Vocapia Research develops speech processing technologies for multilingual, large vocabulary speech recognition (speech-to-text), automatic audio segmentation, language identification and speaker recognition.
The Vocapia Research VoxSigma software suite delivers state of the art performance for broadcast data and conversational speech in multiple languages.
This core technology can serve as the basis for a variety of applications ranging from interactive conversational systems to the automatic indexing of audio data.
For the latter class of applications, large vocabulary continuous speech recognition is the key technology for enabling content-based information access in audio and video documents. Most of the linguistic information is encoded in the audio channel of audiovisual data, which once transcribed can be accessed using text-based tools.