VisualSpeechCode VIBE: predicting 3D human model parameters from images

Wav2Lip: generate lip motion from voice

LipGAN is a technology that generates the shape of the lips of a face image using a voice signal, and when it is actually applied to a video, it was somewhat disappointing in terms of visual artifacts and the naturalness of movement. To improve this, the discriminator is not a single frame, but a plurality of consecutive…