Video QA – 3D Attention is All You Need
Typically, Q&A systems use text to answer questions. A task in this way is the Squad task, which gives you a paragraph explaining a fact and then asks a question and generates an appropriate answer. In contrast, Visual QA instead of text…