New GAN Can Lipread and Synthesize Speech

Originally published at: New GAN Can Lipread and Synthesize Speech | NVIDIA Technical Blog

Current audio speech recognition models normally do not perform well in noisy environments. To help solve the problem, researchers from Samsung and Imperial College in London developed a deep learning solution that uses computer vision for visual speech recognition. The model is capable of lipreading, as well as synthesizing audio it sees from the video. …