Voices & Text to Speech
Clone
Create a custom voice clone using Bland’s BTTS v3 engine
POST
Authentication
Your API key for authentication.
Request Body
The request must be sent asmultipart/form-data with the following fields:
The name for your voice clone. Must be 30 characters or less.Validation:
- Required field
- Maximum 30 characters
- Will be used as the display name in your voice library
Audio file(s) containing voice samples for cloning.Requirements:
- At least one audio file is required
- The default BTTS v3 engine only consumes a single sample; additional files are ignored
- Supported formats: WAV, MP3, and other common audio formats
- Maximum file size: 10MB per file
- Maximum total upload size: 25MB
- Audio duration: 1-60 seconds per sample
- Higher quality audio samples produce better voice clones
- Recommended: 10-15 seconds of clear speech
The gender of the voice being cloned.Valid values:
"male""female"
- Optional but recommended for better voice modeling
- Helps the BTTS system optimize voice characteristics
Optional description for the voice clone.Usage:
- Provides additional context to underlying model for tone/style/etc.
Opt into the production-grade BTTS v2 cloning engine instead of the default BTTS v3.Usage:
- Pass
"true"(as a string) in the multipart form to use BTTS v2 - V2 is the previous-generation stable engine; V3 is the default
Audio Validation Limits
The following limits are enforced for audio file uploads:- Maximum file size: 10MB per individual file
- Maximum total upload size: 25MB per request
- Maximum files per upload: 5 files
- Maximum samples per voice: 5 samples total (across all uploads)
- Audio duration range: 1-60 seconds per sample
- Minimum duration: 1 second per sample
Response
HTTP status code (200 for success)
Array of error objects (null on success)