Telecom Server

Get high performance TTS server for multi-threaded environments

Start of content

Optimized for large-scale telephony applications

IVONA Telecom Server is a stand-alone software based on IVONA TTS Engine. It can be used in variety of applications that require multi-threaded TTS services. Customers can integrate Text-to-Speech with client-server architecture.

Benefits for you

  • Built-in text normalization (numbers, currency and more)
  • Prosody control (volume, speed, pitch)
  • User level pronunciation lexicon
  • Dynamic voice and language switching
  • Support for phonetic alphabets
  • Support for MRCPv1/v2

Recommended Telecom Server Uses

Telecommunications

Improve caller experience in IVR systems and telephony solutions.

Licensing Model

IVONA Telecom’s pricing is based on the maximum number of concurrent users streaming speech at the same time. Minimum order - 5 ports.

Hosted Service Providers

We have prepared a special offer for Hosted IVR Service Providers and Hosted Virtual Assistant Providers who want to integrate high-quality TTS functionality into their platform. Please contact us for more information.

IVONA Telecom Server Specifications

IVONA Telecom MRCP

IVONA Telecom SAPI

Technology

BrightVoice

Natural lifelike voices resulting from innovative approach to unit selection technology. Reduced unnatural discontinuities, electronic noise, and audible glitches. High accuracy through sophisticated NLP algorithms built into TTS engine. Support for natural reading of short and long texts.

Languages and voices

See voices list at http://www.ivona.com/en/voices-list/

Prosody control

Ability to adjust volume, speech rate and pitch at runtime.

Built-in domains support

IVONA TTS has built-in mechanisms to correctly pronounce texts from specific communicative contexts such as social text, acronyms, abbreviations and numbers.

Mixing static expressive prompts

Mechanism to mix static audio prompts with dynamically generated TTS output.

Support for phonetic alphabets

IPA, X-SAMPA, TeleAtlas®, Navteq™

Requirements

CPU requirements

X86 (32/64 bit)

X86 (32/64 bit)

RAM

Recommended min. 128MB for each voice

Recommended min. 128MB for each voice

OS

Linux

Windows, Windows Server

Interfaces

MRCP v1 (RFC 4463), MRCP v2, Command line, TCP/IP, Unix socket, Asterisk plugin versions: 1.2, 1.4, 1.6, 1.8

SAPI 5, command line

Standards compliance

W3C SSML 1.0/1.1, W3C PLS 1.0 (with IVONA extensions)

W3C SSML 1.0/1.1, W3C PLS 1.0 (with IVONA extensions), SAPI markup (with support for mixing with SSML tags)

Product features

Sampling rate

8 kHz

8 kHz

Audio formats

A-law, μ-law, PCM 16 bit mono

A-law, μ-law, PCM 16 bit mono

Components

Media Server (MRCP), Speech Server (daemon), tools, documentation

SAPI component, tools, documentation

Scalability by multiplying speech servers

N/A