MLP Singer

[Priority Research Team Hee-Jo Yoo] TTS (text-to-speech) is a technology that converts text into a voice of a specific voice when inputting arbitrary text. After Google announced the Tacotron series, it quickly switched from HMM (hidden Markov model)-based to deep learning-based, and is now commercially…

SpeechTrend MLP Singer

Can Machines Think? Emotionally

[Service Development Team, Eunji Kwon] When I was a child, when I was drawing imagination, robots in outer space were a favorite material. Looking back, from cartoons (Galaxy Railroad 999) to captains of artificial intelligence computers that move trains to recently released Humanoid movies, artificial intelligence is an important part of the media...

SpeechTrend MLP Singer

LaMDA-Google's Conversational Language Model

[Service Development Team Kim Byung-in] At Google I/O 2021, an event that showcases the latest Google technologies, Android, Web, artificial intelligence, Chrome, and other technologies, services, and platform services were released. Among the many technologies, the hottest topic is LaMDA (Google's language…

VisualSpeechCode MLP Singer

Wav2Lip: generate lip motion from voice

LipGAN is a technology that generates the shape of the lips of a face image using a voice signal, and when it is actually applied to a video, it was somewhat disappointing in terms of visual artifacts and the naturalness of movement. To improve this, the discriminator is not a single frame, but a plurality of consecutive…