Beginner3 min read
Lipsync to New Audio
Replace what someone is saying in a video — keep their face, swap the dialogue.
What you’ll build
Replace what someone is saying in a video — keep their face, swap the dialogue. Every step runs locally on your own GPU. No subscription. No per-render meter. The chain is yours, the file is yours, the render lives on your drive forever.
The chain
One pipeline node:
- 1LatentSyncLocal~20s – 45s
Syncs the on-screen mouth to the spoken audio.
Prerequisites
- An NVIDIA RTX GPU with 12 GB+ VRAM. 24 GB recommended for the full pipeline. Check system requirements →
- Hybrig desktop installed and your worker connected. Get the desktop app →
- Source video on hand. MP4, MOV, or WebM. Anything your camera shoots works.
Try it
The fastest way to learn this chain is to drop it onto the Studio canvas and run it on your own rig. The whole graph is pre-wired.
Watch
Walkthrough video coming
Video walkthrough coming. For now, run the workflow and watch the pipeline timeline — every stage exposes its intermediate artifact and a plain-language description of what that node did.