Voice API
  1. Custom Pronunciations
Voice API
  • Welcome to Async Voice API
  • Getting Started with the Async Voice API
  • API Reference
    • API Status
      • API Status Check
    • Text-to-Speech
      • Text to Speech (WebSocket)
      • Text to Speech
      • Text to Speech with Word Timestamps
      • Text to Speech (Stream)
    • Voice Management
      • Clone Voice
      • List Voices
      • Get Voice
      • Update Voice
      • Delete Voice
      • Get Voice Preview
  • Advanced Guides
    • Embed Player
    • Custom Pronunciations
      • Embedding Custom Phonemes in Async Voice API
      • Pronouncing digits one‑by‑one
      • Insert Silent Pauses with <break>
  • Integrations
    • Integrate with Twilio
    • Pipecat Integration
  • Schemas
    • Sample Schemas
      • Languages
      • Pet
      • Category
      • Tag
  1. Custom Pronunciations

Embedding Custom Phonemes in Async Voice API

1  Why provide custom phonemes?#

Default grapheme‑to‑phoneme (G2P) conversion works well for most words, but brand names, acronyms, foreign words, and creative spellings can trip it up. The <phonemes> tag lets you override the pronunciation inline, so you keep the surrounding text natural while guaranteeing a correct reading of tricky words.

2  Tag syntax#

ElementRequiredDescription
<phonemes>✓Opening tag marking the start of the phonetic override.
…IPA string…✓A sequence of IPA symbols. Separate words with spaces. Use ˈ (primary stress) and ˌ (secondary stress) as needed.
</phonemes>✓Closing tag.
Example
Welcome to <phonemes>ˈeɪ.sɪŋk</phonemes>.


3 Supported symbol set#

Standard IPA characters
(e.g., ɹ, ʒ, ð, ŋ, etc.).
Stress marks: ˈ (primary) & ˌ (secondary).
Word separator: space ( ). Do not use slashes (/…/) or brackets ([…]).
Unsupported at present:
SAMPA, ARPABET or other non‑IPA alphabets.

4  Request format#

POST /text_to_speech/streaming HTTP/1.1
Host: api.async.ai
Content-Type: application/json
X-Api-Key: <YOUR_API_KEY>

{
  "model_id": "asyncflow_v2.0",
  "transcript": "Welcome to <phonemes>ˈeɪ.sɪŋk</phonemes>. Here you can test our text-to-speech capabilities.",
  "voice": { "mode": "id", "id": "e0f39dc4-f691-4e78-bba5-5c636692cc04" },
  "output_format": {
    "container": "raw",
    "encoding": "pcm_s16le",
    "sample_rate": 44100
  }
}

Curl shortcut#


5  Best practices#

1.
Keep tags short – one or two words. Large phoneme blocks reduce prosody quality.
2.
Stress wisely – adding primary stress (ˈ) usually suffices.
3.
Test incrementally – start with small examples and listen before batch‑generating.

6 IPA Symbol Reference (English)#

IPA overrides also work with multilingual voices. Reference tables here use English as an example.
Below is a quick‑lookup table of the standard IPA symbols you can embed inside <phonemes> when working with English voices in Async. All examples use General American pronunciations.

6.1  Consonants#

IPASample wordFull IPA
ppinpɪn
bbinbɪn
ttintɪn
ddindɪn
kcatkæt
ɡgoɡoʊ
mmanmæn
nnonoʊ
ŋsingsɪŋ
ffanfæn
vvanvæn
θthinθɪn
ðthisðɪs
sseesiː
zzoozuː
ʃsheʃiː
ʒmeasureˈmɛʒɚ
hhehiː
tʃchecktʃɛk
dʒjumpdʒʌmp
rredɹɛd
lletlɛt
jyesjɛs
wwewiː

6.2  Vowels#

6.2.1  Monophthongs#

IPASample wordFull IPA
iseesiː
ɪsitsɪt
esay (Canadian/British)seɪ
ɛsetsɛt
æcatkæt
ʌcutkʌt
əaboutəˈbaʊt
ɜːbird (British)bɜːd
ɑfatherˈfɑðɚ
ɔtalktɔk
ʊfootfʊt
ufoodfuːd

6.2.2  Diphthongs#

IPASample wordFull IPA
eɪsayseɪ
aɪsighsaɪ
ɔɪjoydʒɔɪ
aʊsowsaʊ
oʊsosoʊ
ɪərhere (British)hɪər
eərbear (British)beər
ʊərtour (British)tʊər

6.3  Suprasegmentals & Diacritics#

IPAMeaningExampleNotes
ˈPrimary stressbanana → bəˈnænəMark the vowel beginning the stressed syllable.
ˌSecondary stresstransporˌtaˈtionOptional.
ːLength marksee → siːIndicates vowel length; appears in many vowels above.

7 FAQ#

Q  Can I use SAMPA or ARPABET?
A  Not yet. We currently accept only IPA symbols.
Q  Do I have to tag the whole sentence?
A  No. Tag only the parts you want to override; everything else is phonemized automatically.
Q Do you support other languages?
A Yes. Phoneme overrides are supported for all currently available languages (English, Spanish, French, German, Italian, Portuguese, Arabic, Russian, Romanian, Japanese, Hebrew, Armenian, Turkish, Hindi, Chinese). Use standard IPA symbols for the target language.

Modified at 2025-12-08 10:11:46
Previous
Embed Player
Next
Pronouncing digits one‑by‑one
Built with