OpenVQA is a general platform for visual question ansering (VQA) research, with implementing state-of-the-art approaches (e.g., BUTD, MFH, BAN and MCAN) on different benchmark datasets like VQA-v2, GQA and CLEVR.
This project is released under the Apache 2.0 license.
Supported methods and benchmark datasets are shown in the below table. Results and models are available in MODEL_ZOO.md.
| VQA-v2 | GQA | CLEVR | |
|---|---|---|---|
| BUTD | |||
| MLB | |||
| MFB | |||
| MFH | |||
| BAN | ✓ | ||
| MCAN | ✓ |
Please refer to INSTALL.md for installation and dataset preparation.
Please see GETTING_STARTED.md for the basic usage of OpenVQA.
We appreciate all contributions to improve OpenVQA. Please refer to CONTRIBUTING.md for the contributing guideline.
