Xiangyinge Logo
Back to Blog
Guides & TutorialsBeginnerHokkien

Complete Guide to Hokkien TTS - Bridging Voices Across the Strait

Master Hokkien/Taiwanese text-to-speech technology, from linguistic features to API integration.

XiangYinGe Team

XiangYinGe Team

1/20/20255 Reading time

What is Hokkien?

Hokkien (also known as Min Nan, Fujian dialect, or Taiwanese) is one of the seven major Chinese dialect groups. It is primarily spoken in southern Fujian province, Taiwan, eastern Guangdong, and throughout Southeast Asian Chinese communities. With approximately 50 million native speakers worldwide, Hokkien is one of the most widely spoken Chinese dialects among overseas Chinese.

Cultural Significance of Hokkien

Hokkien carries a rich cultural heritage:

  • Gezai Opera: Taiwan's most representative traditional opera form
  • Nanyin Music: Ancient musical tradition known as the "living fossil of Chinese music"
  • Hokkien Pop Songs: A vibrant music culture spanning both sides of the Taiwan Strait
  • Ancestral Hall Culture: Essential for ancestor worship and preserving family heritage

Phonetic Features of Hokkien

Tone System

Hokkien has a complex tonal system, with most regions featuring 8 tones (some areas have 7):

Tone Category Pitch Pattern Description
Yin Ping 55 High level
Yin Shang 51 High falling
Yin Qu 21 Low falling
Yin Ru 3 Short low
Yang Ping 24 Mid rising
Yang Qu 33 Mid level
Yang Ru 5 Short high

Literary vs Colloquial Readings

One of Hokkien's most distinctive features is the literary and colloquial reading phenomenon—the same character can have different pronunciations in different contexts:

  • Literary reading: Derived from ancient written language, used in formal contexts
  • Colloquial reading: Everyday spoken pronunciation, more natural for daily conversation

For example, the character "學" (learn):

  • Literary: hak (school, study)
  • Colloquial: o̍h (to learn, to imitate)

Tone Sandhi

Hokkien has complex but regular tone sandhi rules. In phrases, all syllables except the last one undergo tone change:

XiangYinGe's AI model has built-in tone sandhi rules. Simply input your text, and the system handles the tone changes automatically.

Nasalized Vowels

Hokkien preserves many nasalized vowels, a distinctive feature that sets it apart from Mandarin. These nasal sounds give Hokkien its characteristic soft, melodic quality.

Supported Hokkien Varieties

XiangYinGe supports multiple Hokkien varieties to serve users from different regions:

Xiamen Dialect

  • The standard reference for Hokkien phonology
  • Clear pronunciation with systematic tone patterns
  • Ideal for formal content and educational materials

Quanzhou Dialect

  • Preserves more archaic phonetic features
  • Rich tonal variations
  • The standard pronunciation for Nanyin music

Zhangzhou Dialect

  • Smooth, gentle intonation
  • Strong colloquial characteristics
  • Suitable for casual conversational content

Taiwanese Hokkien

  • A blend of Quanzhou and Zhangzhou accents
  • Incorporates Japanese loanwords
  • Taiwan's primary native language

Quick Start

API Example

import requests
import json

API_KEY = "your_api_key_here"
API_URL = "https://api.xiangyinge.com/v1/tts"

data = {
    "text": "你好,歡迎來到乡音阁!",
    "dialect": "nan",
    "voice": "nan_female_xiamen",
    "speed": 1.0
}

headers = {
    "Authorization": f"Bearer {API_KEY}",
    "Content-Type": "application/json"
}

response = requests.post(API_URL, json=data, headers=headers)

if response.status_code == 200:
    with open("output.mp3", "wb") as f:
        f.write(response.content)
    print("Hokkien audio generated successfully!")

Parameter Reference

Parameter Description Options
dialect Dialect type nan (Hokkien)
voice Voice selection nan_female_xiamen, nan_male_taiwan, nan_female_quanzhou
speed Speech rate 0.5 - 2.0 (default 1.0)
pitch Pitch adjustment 0.5 - 2.0 (default 1.0)

Voice Selection Tips

  • News/Formal content: Choose Xiamen dialect for clear, standard pronunciation
  • Entertainment content: Choose Taiwanese for a lively, natural feel
  • Cultural documentaries: Choose Quanzhou dialect for an authentic, classical tone
  • Daily life content: Match the voice to your target audience's region

Typical Use Cases

Taiwanese Variety Show Dubbing

Taiwanese is the soul of Taiwan's entertainment industry. Use Hokkien TTS to:

  • Generate show narrations quickly
  • Create comedic dubbing videos
  • Develop Taiwanese language learning content

Hokkien Music Production

Support your music creation workflow:

  • Pronunciation reference for lyrics
  • Demo production assistance
  • Singing tutorial materials

Ancestral Hall Cultural Videos

Help document and preserve family heritage:

  • Genealogy narration
  • Ceremonial explanations
  • Family story recordings

Overseas Chinese Nostalgic Content

Connect diaspora communities with their roots:

  • Hometown dialect podcasts
  • Regional introduction videos
  • Festival greeting content

Best Practices

Literary vs Colloquial Reading Selection

The system automatically favors literary readings for written or technical content and colloquial readings for casual speech. Add pronunciation hints in your text if you need specific control.

Number and Date Pronunciation

Numbers in Hokkien follow specific patterns:

  • Phone numbers: Read digit by digit
  • Prices: Use colloquial expressions
  • Dates: Each component (year/month/day) has specific conventions

Handling Loanwords

Hokkien has absorbed many loanwords, especially Japanese terms in Taiwanese:

  • Common loanwords are built into the system
  • For unfamiliar terms, use phonetic notation or transliteration

Frequently Asked Questions

Should I choose Xiamen dialect or Taiwanese?

  • Mainland China audience: Choose Xiamen or Quanzhou dialect
  • Taiwan audience: Choose Taiwanese
  • Global Chinese audience: Taiwanese has broader recognition

Do I need to manually mark tone sandhi?

No. XiangYinGe's AI model has learned Hokkien's tone sandhi rules and handles them automatically. If you need to preserve original tones in specific cases, add punctuation between words.

How do I handle mixed Mandarin and Hokkien content?

We recommend processing pure Mandarin and Hokkien segments separately, then combining the generated audio. This ensures optimal pronunciation quality for each language.

Next Steps

Ready to start creating Hokkien content?

For questions, contact us at: hello@xiangyinge.com