Skip to main content
POST
/
api
/
v1
/
voice
/
clone
Create a custom cloned voice from audio samples
const form = new FormData();
form.append('Name', '<string>');
form.append('Description', '<string>');
form.append('Language', '<string>');
form.append('Provider', 'ElevenLabs');
form.append('ProviderSpecific.ElevenLabs.RemoveBackgroundNoise', 'true');
form.append('ProviderSpecific.Cartesia.Mode', '<string>');
form.append('ProviderSpecific.Cartesia.Enhance', 'true');
form.append('ProviderSpecific.Cartesia.Transcript', '<string>');
form.append('Labels', '{}');
form.append('audioFiles', '<string>');
form.append('audioFiles.items', '{
  "fileName": "example-file"
}');

const options = {method: 'POST', headers: {Authorization: 'Bearer <token>'}};

options.body = form;

fetch('https://blackbox.dasha.ai/api/v1/voice/clone', options)
  .then(res => res.json())
  .then(res => console.log(res))
  .catch(err => console.error(err));
{
  "id": "<string>",
  "provider": "ElevenLabs",
  "category": "Public",
  "name": "<string>",
  "voiceId": "<string>",
  "description": "<string>",
  "language": "<string>",
  "labels": {},
  "previewUrl": "<string>",
  "createdTime": "2023-11-07T05:31:56Z",
  "lastUpdatedTime": "2023-11-07T05:31:56Z"
}

Body

multipart/form-data
Name
string
required

Name identifier for the cloned voice

Required string length: 1 - 100
Description
string
required

Description of voice characteristics

Required string length: 1 - 1000
Language
string
required

Primary language for the voice model

Provider
enum<string>
required

Voice cloning service provider

Available options:
ElevenLabs,
Cartesia,
Dasha,
Inworld,
Lmnt
ProviderSpecific.ElevenLabs.RemoveBackgroundNoise
boolean

Whether to remove background noise from audio

ProviderSpecific.Cartesia.Mode
string

Cloning mode: Stability or Similarity

ProviderSpecific.Cartesia.Enhance
boolean

Whether to enhance audio quality

ProviderSpecific.Cartesia.Transcript
string

Transcript for voice cloning

Labels
object

Custom metadata labels

audioFiles
file[]

Audio files for voice cloning

Response

Returns the created cloned voice details

Response DTO for TTS voice cloning operations

id
string
required

Unique identifier for the voice

Minimum string length: 1
provider
enum<string>
required

TTS provider (ElevenLabs, Cartesia, Dasha, Inworld, Lmnt)

Available options:
ElevenLabs,
Cartesia,
Dasha,
Inworld,
Lmnt
category
enum<string>
required

Voice category (Public or Cloned)

Available options:
Public,
Cloned
name
string | null

Display name of the voice

voiceId
string | null

Voice ID used for synthesis

description
string | null

Description of voice characteristics

language
string | null

Primary language for the voice

labels
object

Custom metadata labels

previewUrl
string | null

URL for voice preview audio

createdTime
string<date-time> | null

Timestamp when voice was created

lastUpdatedTime
string<date-time> | null

Timestamp when voice was last updated