Skip to main content
POST
/
v1
/
voices
create voice
curl --request POST \
  --url https://api.anam.ai/v1/voices \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: multipart/form-data' \
  --form 'name=<string>' \
  --form audioFile='@example-file' \
  --form 'description=<string>' \
  --form language=en \
  --form enhance=true
{
  "id": "de23e340-1416-4dd8-977d-065a7ca11697",
  "displayName": "Lucy - Fresh & Casual",
  "provider": "ELEVENLABS",
  "providerVoiceId": "lcMyyd2HUfFzxdCaC4Ta",
  "providerModelId": "eleven_flash_v2_5",
  "sampleUrl": "https://newgxnc1uqs0jnqm.public.blob.vercel-storage.com/voice-samples/de23e340-1416-4dd8-977d-065a7ca11697/1760617899390.mp3",
  "previewSampleUrl": "https://newgxnc1uqs0jnqm.public.blob.vercel-storage.com/voice-samples/de23e340-1416-4dd8-977d-065a7ca11697/1760617899390.mp3",
  "gender": "FEMALE",
  "country": "GB",
  "description": "Energetic and youthful British voice, perfect for narrations and conversational agents.",
  "displayTags": [
    "fast"
  ],
  "isZdr": true,
  "createdByOrganizationId": null,
  "createdAt": "2026-04-20T10:00:00.000Z",
  "updatedAt": "2026-04-20T10:00:00.000Z"
}

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data

Audio sample and metadata used to clone a new voice.

name
string
required

Display name for the cloned voice

audioFile
file
required

Audio file to clone the voice from

description
string

Optional description of the voice

language
string
default:en

Language code for the voice

enhance
boolean
default:true

Whether to enhance the voice quality

Response

Successfully created voice clone

A voice preset a persona can use for text-to-speech.

id
string<uuid>

Unique identifier for the voice.

displayName
string

Human-readable name shown in the Lab.

provider
enum<string>

Upstream TTS provider for this voice.

Available options:
CARTESIA,
ELEVENLABS,
OPENAI_ADVANCED_VOICE
providerVoiceId
string | null

The upstream provider's identifier for the voice.

providerModelId
string | null

The upstream provider's model identifier used to generate speech.

sampleUrl
string<uri> | null

URL of a short audio preview of the voice.

previewSampleUrl
string<uri> | null

Alias for sampleUrl, kept for backwards compatibility.

gender
enum<string> | null

Perceived gender of the voice, if categorised.

Available options:
MALE,
FEMALE,
NEUTRAL,
null
country
string | null

ISO 3166-1 alpha-2 country code representing the voice's accent.

description
string | null

Free-form description of the voice's character.

displayTags
string[]

Tags used to categorise the voice in the Lab UI.

isZdr
boolean

Whether this voice meets the Zero Data Retention requirements.

createdByOrganizationId
string | null

ID of the organization that created the voice, or null for stock voices. IDs may be either UUIDs or nanoid-style strings depending on when the organization was created.

createdAt
string<date-time>

Timestamp when the voice was created.

updatedAt
string<date-time>

Timestamp when the voice was last updated.

Last modified on April 22, 2026