create voice

curl --request POST \ --url https://api.anam.ai/v1/voices \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: multipart/form-data' \ --form 'name=<string>' \ --form audioFile='@example-file' \ --form 'description=<string>' \ --form language=en \ --form enhance=true

{ "id": "de23e340-1416-4dd8-977d-065a7ca11697", "displayName": "Lucy - Fresh & Casual", "provider": "ELEVENLABS", "providerVoiceId": "lcMyyd2HUfFzxdCaC4Ta", "providerModelId": "eleven_flash_v2_5", "sampleUrl": "https://newgxnc1uqs0jnqm.public.blob.vercel-storage.com/voice-samples/de23e340-1416-4dd8-977d-065a7ca11697/1760617899390.mp3", "previewSampleUrl": "https://newgxnc1uqs0jnqm.public.blob.vercel-storage.com/voice-samples/de23e340-1416-4dd8-977d-065a7ca11697/1760617899390.mp3", "gender": "FEMALE", "country": "GB", "description": "Energetic and youthful British voice, perfect for narrations and conversational agents.", "displayTags": [ "fast" ], "isZdr": true, "createdByOrganizationId": null, "createdAt": "2026-04-20T10:00:00.000Z", "updatedAt": "2026-04-20T10:00:00.000Z" }

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

multipart/form-data

Audio sample and metadata used to clone a new voice.

name

string

required

Display name for the cloned voice

audioFile

file

required

Audio file to clone the voice from

description

string

Optional description of the voice

language

string

default:en

Language code for the voice

enhance

boolean

default:true

Whether to enhance the voice quality

Response

Successfully created voice clone

A voice preset a persona can use for text-to-speech.

string<uuid>

Unique identifier for the voice.

displayName

string

Human-readable name shown in the Lab.

provider

enum<string>

Upstream TTS provider for this voice.

Available options:

CARTESIA,

ELEVENLABS,

OPENAI_ADVANCED_VOICE,

FISH_AUDIO

providerVoiceId

string | null

The upstream provider's identifier for the voice.

providerModelId

string | null

The upstream provider's model identifier used to generate speech.

sampleUrl

string<uri> | null

URL of a short audio preview of the voice.

previewSampleUrl

string<uri> | null

Alias for sampleUrl, kept for backwards compatibility.

gender

enum<string> | null

Perceived gender of the voice, if categorised.

Available options:

MALE,

FEMALE,

NEUTRAL,

null

country

string | null

ISO 3166-1 alpha-2 country code representing the voice's accent.

description

string | null

Free-form description of the voice's character.

displayTags

string[]

Tags used to categorise the voice in the Lab UI.

isZdr

boolean

Whether this voice meets the Zero Data Retention requirements.

createdByOrganizationId

string | null

ID of the organization that created the voice, or null for stock voices. IDs may be either UUIDs or nanoid-style strings depending on when the organization was created.

createdAt

string<date-time>

Timestamp when the voice was created.

updatedAt

string<date-time>

Timestamp when the voice was last updated.