LTX-2 is an open source AI video model with 14B video and 15B audio parameters, giving you synced clips and local control.
To match the lip movements with speech, they designed a "learning pipeline" to collect visual data from lip movements. An AI model uses this data for training, then generates reference points for ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results