Detailed configuration for each TTS provider. For basic setup, see Voice & Speech.Documentation Index
Fetch the complete documentation index at: https://docs.blackbox.dasha.ai/llms.txt
Use this file to discover all available pages before exploring further.
ElevenLabs
Industry-leading voice quality with extensive customization. Browse voices → Models:eleven_multilingual_v2— Best quality, 29+ languageseleven_turbo_v2_5— Balanced speed and qualityeleven_flash_v2_5— Fastest inference
Configuration options
| Option | Range | Default | Effect |
|---|---|---|---|
similarity_boost | 0.0–1.0 | 0.75 | Voice consistency with original |
stability | 0.0–1.0 | 0.5 | Output consistency across generations |
style | 0.0–1.0 | 0.3 | Speaker style exaggeration |
use_speaker_boost | boolean | true | Enhanced clarity |
optimize_streaming_latency | 0–4 | 4 | Trade quality for speed |
Example
ElevenLabs configuration example
ElevenLabs configuration example
Cartesia
Granular emotion control with wide speed range. Browse voices → Models:sonic-3— Latest generation (recommended)sonic-2— Previous generationsonic— Original model
Emotion system
Combine emotions for nuanced delivery:| Dimension | Intensities |
|---|---|
| anger | lowest, low, high, highest |
| positivity | lowest, low, high, highest |
| surprise | lowest, low, high, highest |
| sadness | lowest, low, high, highest |
| curiosity | lowest, low, high, highest |
Example
Cartesia configuration example
Cartesia configuration example
Inworld
Character-focused voices for gaming and interactive media. Browse voices → Models:inworld-tts-1.5-max— Highest qualityinworld-tts-1.5-mini— Balanced speed and quality (recommended)inworld-tts-1— Original model
Configuration options
| Option | Default | Effect |
|---|---|---|
temperature | 0.8 | Voice expressiveness |
pitch | 0.0 | Pitch adjustment (+/-) |
Example
Inworld configuration example
Inworld configuration example
LMNT
Consistent, lightweight synthesis. Browse voices → Models:blizzard— Standard model
Example
LMNT configuration example
LMNT configuration example
LMNT does not support speed adjustment. Voice always plays at 1.0x.
Advanced settings
Responsiveness
Controls how quickly the agent begins speaking after the user finishes.| Value | Behavior |
|---|---|
| 1.0 | Most responsive — minimal delay (recommended) |
| 0.7 | Slight delay added |
| 0.5 | Moderate delay |
| 0.0 | Maximum delay |
Responsiveness configuration example
Responsiveness configuration example
Dynamic speed adjustment
Allow agents to adapt speech pace when users request it (“Can you speak more slowly?”).Speed adjustment configuration example
Speed adjustment configuration example
| Strategy | Behavior |
|---|---|
OnRequest | Agent adjusts speed when user requests (default) |
Disabled | Speed remains fixed |
Speed ranges by provider
| Provider | Min | Max | Default | Recommended |
|---|---|---|---|---|
| ElevenLabs | 0.70x | 1.20x | 1.0x | 0.9x–1.2x |
| Cartesia | 0x (sent as 0.25x) | 2.0x | 1.0x | 0.8x–1.3x |
| Inworld | 0.80x | 1.50x | 1.0x | 0.9x–1.1x |
| LMNT | 1.0x | 1.0x | 1.0x | 1.0x (fixed) |
Provider decision guide
Choose ElevenLabs if:- You need highest voice quality
- Brand-specific voice cloning is important
- Advanced customization is required
- Emotional expression is important
- You need wide speed range (0x-2.0x)
- You’re building character-driven experiences
- Gaming or interactive media is your use case
- You need consistent, predictable output
- Minimal configuration is desired
Pronunciation dictionary
Customize how your agent pronounces brand names, acronyms, and technical terms.Provider support
| Provider | Alias rules | Phoneme rules |
|---|---|---|
| ElevenLabs | Yes | No |
| Cartesia | Yes | Yes |
| Inworld | No | No |
| LMNT | No | No |
Create a dictionary
API example: Create pronunciation dictionary
API example: Create pronunciation dictionary
Reference in agent config
Pronunciation dictionary reference example
Pronunciation dictionary reference example
Multilingual configuration
For agents that switch languages mid-conversation:Multilingual configuration example
Multilingual configuration example
Related
Voice & Speech
Basic voice configuration
Dashboard Testing
Test voice quality in the browser