Code

Nov62020

VIBE: predicting 3D human model parameters from images

In order to implement the visualization part of Human-Like AI, it is necessary to think about how to create and move a 3D human model. There are various existing approaches, but one of them is from Max Planck ETH Center to CVPR 2020...

Nov42020

Multilingual translation model and language model trained with data from over 100 countries

Interaction, Code

Many attempts are being made to expand the language model and translation model, which were previously studied mainly in English, into multiple languages. Google's mT5 is a study that extends the existing T5 (text-to-text transfer transformer) into a multilingual corpus, including a total of 101 languages...

Oct142020

Visual Code

MEAL v2: Achieving ImageNet Top-1 80% with ResNet-50

Visual, Code

Combining multiple network models into an ensemble increases performance, but it is a reality that there are many difficulties when applied in practice because the total network size and inference time also increase. Multi-model Ensemble via Adversarial Learning (MEAL) solves this...

Oct82020

Visual Code

Scalable character animation based on reinforcement learning

Visual, Code

It's natural to see virtual characters and moves reasonably in terms of the laws of physics, that is, human-like, which has been a subject of long-standing research in the field of gaming as well as computer graphics. Facebook Jungdam Won's project as the first author, “A Scalable…

Oct72020

Visual Speech Code

Wav2Lip: generate lip motion from voice

Visual, Speech, Code

LipGAN is a technology that generates the shape of the lips of a face image using a voice signal, and when it is actually applied to a video, it was somewhat disappointing in terms of visual artifacts and the naturalness of movement. To improve this, the discriminator is not a single frame, but a plurality of consecutive…

Oct52020

Interaction Code

NLP acceleration with HuggingFace and ONNX Runtime

Interaction, Code

The performance improvement shown by Transformer-based language models is surprising, but as the model size increases exponentially, concerns about service costs are also becoming important. Bert-base or GPT-2 has about 100 million parameters, so the model size, memory bandwidth,...

Sep232020

Interaction Trend Code

GPT-3 examples and minGPT project

Interaction, Trend, Code

Scatterlab (https://scatterlab.co.kr/), which stands out in everyday conversational research, is an article on the Ping-Pong team blog. I still see GPT-3 as a'eye of doubt', but it's curious when I see it again...

Sep212020

Visual Code

Generation code of Lee Malnyun webtoon style faces

Visual, Code

bryandlee's github has the results of image translation application using deep generative model and related research made into a webcomic in the late years of calm man. The title of the study is also “Chilled Generative Model Learner”. I like this wit! Looking at the process, webtoon…

Sep182020

Interaction Code

Facebook TransCoder: Unsupervised Programming Language Translator

Interaction, Code

There have been many attempts to convert code written in one programming language into another, and there are many types of commercial tools. The main purpose of use is to ensure compatibility, for example FORTRAN or BASIC, or...