CoVoST is a large-scale multilingual speech-to-text translation corpus based on the Common Voice project. It provides translations from English into 15 languages and from 21 languages into English, with a total of 78K speakers and 2,880 hours of speech. The data is available under a CC0 license.
DensePose, dense human pose estimation, is designed to map all human pixels of an RGB image to a 3D surface-based representation of the human body.
Detectron2 was built by Facebook AI Research (FAIR) to support rapid implementation and evaluation of novel computer vision research.
ELF OpenGo is an AI bot from Facebook AI Research (FAIR) that has defeated world champion professional Go players.
Fairseq is a sequence modeling toolkit for training custom models for translation, summarization, and other text generation tasks.
A lightweight library designed to help build scalable solutions for text representation and classification.
The Hateful Memes Challenge and Dataset is a competition and open source dataset designed to measure progress in multimodal vision-and-language classification.
House3D is a rich environment containing thousands of human-designed 3D scenes of visually realistic houses with fully labeled 3D objects, textures, and scene layouts. These virtual environments can be used to support novel research in deep reinforcement learning.
KILT is a resource for training, evaluating and analyzing NLP models on Knowledge Intensive Language Tasks.
Next
Last