IVONA SDK

Make your devices more user-friendly and productive.

IVONA SDK - Technical specification

PDF
IVONA SDK

IVONA Speech Cloud
(SaaS)
Requirements
Storage memory (ROM) per voice
- 100‐250 MB
- < 50 MB**
- < 20 MB**
0 MB
Runtime memory (RAM)
- 5 - 13 MB
- < 5MB**
0 MB
CPU
Scalable
Under 50 MIPS
Chipset
X86 (32/64 bit); ARM 7,8,9,11; Strong ARM; X-Scale; Sparc (32/64 bit)*; PowerPC*; MIPS*
OS
Linux, Windows, Android, Windows Mobile, Windows CE, iOS, MeeGo, Mac OS X, Solaris*, FreeBSD*
API
IVONA C/C++ API, IVONA Java API, TCP/IP, Unix socket, SAPI 4*, SAPI 5
Web Services (SOAP), IVONA C/C++ API**, IVONA Java API**
Key features
BrightVoice™: superior speech output quality
Languages and voices
US English (2 male, 3 female, child); British English (male, 2 female); US Spanish (male, female, child**); Spanish (male, female, child**); German (male, female); French (male, female); Polish (2 male, 3 female); Romanian (female); Welsh; Welsh English; Australian English; Italian; Dutch**; Canadian French**; Brazilian Portuguese**; Portuguese**; Icelandic**; Russian**; Korean**; Danish**; Swedish**; Japanese**; Special Effects (SFX)/Character voices*
Sampling rate
8 kHz, 22 kHz (up to 48 kHz*)
8 kHz, 22 kHz (up to 48 kHz*)
Audio formats
PCM 16 bit mono/stereo*, A-law, μ-law, mp3*, vorbis (ogg) *
mp3, vorbis (ogg), PCM 16 bit mono*/stereo** A-law, μ-law
Low response (start/stop) time
SDK components
Libraries, documentation, examples (C/C++, PHP), support; Supported events: word-highlighting (alignment of text with audio), visemes (lip-sync), bookmarks, SSML events
Prosody control: volume, speed, pitch
User level pronunciation lexicon

(with regular expression rules support)
Language detection
**
**
Phoneme mapping for mixed languages input
**
**
Text preprocessing rules for specific domains
*
*
Dynamic voice and language switching
Mixing static expressive prompts
Custom voices: voice branding or custom accent and/or style
Support for natural reading of long texts
Support for phonetic alphabets
IPA, X-SAMPA, TeleAtlas®, Navteq™
IPA, X-SAMPA, TeleAtlas®, Navteq™
Expressive TTS effects
**
**
Standards compliance
W3C SSML 1.0/1.1, W3C PLS 1.0 (with IVONA extensions)
W3C SSML 1.0/1.1 (with IVONA extensions)
Built-in audio effects
**
Support for text highlighting
REST support
N/A
* Available on request
** Feature in development

Examples of implementations

The quality of the IVONA brand is not only evidenced by our numerous awards and honors. We are especially proud of the large number of business partners who have placed their trust in us. More

Professional support

ISMB ISMB - provides expert technical assistance for businesses and includes free software updates.
UPGRADE Upgrade - a special upgrade offer of business products for customers that do not have an ISMB package.

IVONA SDK is used by: