August 5, 2020

Aug52020

Wav2Vec 2.0 Revealed-Create ASR with 10 Minute Voice

After performing representation training with 53,000 hours of label-free data, a pre-trained model for Facebook's wav2vec 2.0, which became a hot topic because it created a speech recognizer with only 10 minutes of labeled data, was released. No fine-tuning in the representation model,...

Aug52020

MIT DriveSeg-data for road situation awareness research

Visual, Interaction, Data

It is a dataset DriveSeg created for research on road situation awareness (used for self-driving cars, etc.). For each frame of the video, the entire image is pixel-by-pixel semantic labeling. Label is “vehicle, pedestrian, road, sidewalk, bicycle, motorcycle, building,...

Aug52020

Trend

Introduction of autonomous vehicle technology and social consensus

Trend

Although it is a little leap forward, if I see that the addition of physical devices to the AI algorithm is an intelligent robot, I thought that the intelligent robot that will be most popular in the future may be an autonomous vehicle. I got a little curious, so I surveyed on self-driving cars...