session_id you must pass to the Advanced Lip Sync endpoint.
Workflow overview
session_id binds the video and its face analysis together. Without it, lip sync generation cannot proceed.
Video requirements
For best results:- The video must contain at least one clearly visible human face facing the camera
- Use a video where the face is well-lit and not heavily occluded
- The video must be accessible via a public URL, or use a
video_idfrom a previous Kling generation result
Quick example
Parameters
| Parameter | Type | Required | Description |
|---|---|---|---|
video_url | string | One of | Public URL of the source video |
video_id | string | One of | ID of a video generated by Kling |
Response
This endpoint is synchronous — the result is returned immediately without polling.| Field | Description |
|---|---|
data.session_id | Pass this to the lip sync endpoint |
data.face_list[] | All detected faces in the video |
face_list[].face_id | Identifier for each individual face |
face_list[].face_rect | Bounding box {x, y, width, height} in pixels |
face_list will include one entry per detected face.
Next step: Advanced Lip Sync
Use the session_id to generate a lip-synced video.
API Reference
View the interactive API playground for Kling Identify Face.