Skip to content
@MILVLG

Vision and Language Group@ MIL

Hangzhou Dianzi University

Popular repositories Loading

  1. mcan-vqa mcan-vqa Public

    Deep Modular Co-Attention Networks for Visual Question Answering

    Python 458 89

  2. openvqa openvqa Public

    A lightweight, scalable, and general framework for visual question answering research

    Python 330 64

  3. bottom-up-attention.pytorch bottom-up-attention.pytorch Public

    A PyTorch reimplementation of bottom-up-attention models

    Jupyter Notebook 302 76

  4. prophet prophet Public

    Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

    Python 278 28

  5. imp imp Public

    a family of highly capabale yet efficient large multimodal models

    Python 191 15

  6. activitynet-qa activitynet-qa Public

    An VideoQA dataset based on the videos from ActivityNet

    Python 91 10

Repositories

Showing 10 of 17 repositories
  • engineering-practices-of-llms Public

    A course that systematically introduces the foundations and engineering practices of LLMs

    MILVLG/engineering-practices-of-llms’s past year of commit activity
    Python 4 2 0 0 Updated Jan 27, 2026
  • twigvlm Public

    Implementation of ICCV 2025 paper "Growing a Twig to Accelerate Large Vision-Language Models".

    MILVLG/twigvlm’s past year of commit activity
    Python 24 Apache-2.0 3 0 0 Updated Dec 29, 2025
  • smvqa Public

    A VQA benchmark with Street-map Images

    MILVLG/smvqa’s past year of commit activity
    Python 1 Apache-2.0 1 0 0 Updated Nov 17, 2025
  • prophet Public

    Implementation of CVPR 2023 paper "Prompting Large Language Models with Answer Heuristics for Knowledge-based Visual Question Answering".

    MILVLG/prophet’s past year of commit activity
    Python 278 Apache-2.0 28 4 0 Updated Jun 14, 2025
  • imp Public

    a family of highly capabale yet efficient large multimodal models

    MILVLG/imp’s past year of commit activity
    Python 191 Apache-2.0 15 3 3 Updated Aug 23, 2024
  • mlc-imp Public Forked from mlc-ai/mlc-llm

    Enable everyone to develop, optimize and deploy AI models natively on everyone's devices.

    MILVLG/mlc-imp’s past year of commit activity
    Python 13 Apache-2.0 1,995 0 0 Updated May 29, 2024
  • anetqa Public template
    MILVLG/anetqa’s past year of commit activity
    HTML 0 1 0 0 Updated Mar 15, 2024
  • anetqa-code Public
    MILVLG/anetqa-code’s past year of commit activity
    Python 9 Apache-2.0 2 1 0 Updated Mar 7, 2024
  • rosita Public

    ROSITA: Enhancing Vision-and-Language Semantic Alignments via Cross- and Intra-modal Knowledge Integration

    MILVLG/rosita’s past year of commit activity
    Python 56 Apache-2.0 13 1 0 Updated Jun 13, 2023
  • bst Public
    MILVLG/bst’s past year of commit activity
    Python 5 Apache-2.0 1 0 0 Updated May 12, 2023

Most used topics

Loading…