Semantic Representation for Visual Reasoning

Wenfeng Zheng
Shan Liu
Bo Yang

Publisher:ISBN 13: 9781954318038ISBN 10: 1954318030

Paperback & Hardcover deals ―

Amazon IndiaGO Flipkart GOSnapdealGO SapnaOnlineGO Jain Book AgencyGO Books Wagon₹489 Book ChorGO CrosswordGO DC BooksGO

e-book & Audiobook deals ―

Amazon India GO Google Play Books GO Audible GO

* Price may vary from time to time.

* GO = We're not able to fetch the price (please check manually visiting the website).

Share this with your friends:

Facebook Twitter Reddit Pinterest WhatsApp Telegram

Know about the book -

Semantic Representation for Visual Reasoning is written by Wenfeng Zheng and published by . It's available with International Standard Book Number or ISBN identification 1954318030 (ISBN 10) and 9781954318038 (ISBN 13).

Visual Question Answering (VQA) combines natural language processing with digital image processing. The general process for solving a VQA problem is to take the image and the corresponding question as input and finally get the answer. The problems which are similar to VQA require more interdependent inference steps to solve.The research is mainly divided into the non-deep learning model and deep learning model. Most non-deep learning models are based on Bayesian theory. Those non-deep learning model performed poorly. In the field of visual reasoning, the achievement of deep learning significantly improved the accuracy of results. With the deepening of the research on deep learning, its research on the VQA field is becoming more and more mature. Image features are mostly used as the input to get answers. However, the image features are too redundant to learn accurate characterizations within a limited complexity and time. While in the process of human reasoning, abstract description of an image is usually to avoid irrelevant details. Inspired by this, a higher-level representation named semantic representation is introduced. In this book, the authors propose a dataset that weakens image processing and natural language processing. The idea of the Gram matrix is transferred here to build a relationship to represent the related information well. The model using semantic representation as input verifies that more accurate results can be obtained by introducing a high-level semantic representation.

Wenfeng Zheng

More books from the author

LISTENING IS THE NEW READING

Try Amazon Audible today for FREE. Cancel anytime

Start for free now