Visual Question Answering: From Theory to Application
Paperback
$109.99
Premium Members save an extra 10% and all Members collect stamps to save with Rewards. 10 stamps = $5.Learn More
Select a store to view item availability.
Visual Question Answering (VQA) usually combines visual inputs like image and video with a natural language question concerning the input and generates a natural language answer as the output. This is by nature a multi-disciplinary research problem, involving computer vision (CV), natural language processing (NLP), knowledge representation and reasoning (KR), etc.
Further, VQA is an ambitious undertaking, as it must overcome the challenges of general image understanding and the question-an...






















