Create an asynchronous lip sync video generation task. The facial movements in the source video are re-animated to match the provided speech.
Prerequisite: call the identify-face endpoint first to obtain a session_id.
Two input modes are supported:
text, voice_id, and voice_language. The platform converts the text to speech using the specified voice and drives the lip movements.audio_url. The lip movements are driven directly by the supplied audio file.After submission, poll GET /kling/v1/videos/advanced-lip-sync/{task_id} until status is succeeded.
voice_id parameter.Bearer authentication header of the form Bearer <token>, where <token> is your auth token.
Lip sync task created
Task ID
"task_abc123"
Task ID (same as id)
"task_abc123"
"video"
"kling-lip-sync"
Initially empty; poll the query endpoint for updates.
""
Progress 0–100
0
Unix timestamp
1773812605