LTX-2 is an open source AI video model with 14B video and 15B audio parameters, giving you synced clips and local control.
To match the lip movements with speech, they designed a "learning pipeline" to collect visual data from lip movements. An AI model uses this data for training, then generates reference points for ...