feat: add local Ollama VLM input plugin for multimodal visual reasoning by Wanbogang · Pull Request #2467 · OpenMind/OM1

Wanbogang · 2026-03-12T05:33:42Z

Summary

Adds VLM_Ollama_Local, a new input plugin for offline visual reasoning
using a locally running Ollama multimodal model (e.g., llava, moondream).

This addresses the multimodal gap noted in config/ollama.json5 where
llava was listed as a supported model but no visual input plugin existed.

Changes

src/inputs/plugins/vlm_ollama_local.py — new plugin
tests/inputs/plugins/test_vlm_ollama_local.py — 22 tests, 100% coverage

Usage

ollama pull llava
ollama serve

agent_inputs: [
  {
    type: "VLM_Ollama_Local",
    config: {
      model: "llava",
      prompt: "Briefly describe what you see.",
    },
  },
],

Add VLM_Ollama_Local plugin that captures webcam frames and sends them to a locally running Ollama instance (e.g., llava, moondream) for offline visual reasoning without cloud dependency. - Follows existing FuserInput pattern (vlm_local_yolo, vlm_coco_local) - Uses aiohttp to POST base64-encoded frames to Ollama /api/chat - Supports any Ollama multimodal model via config (default: llava) - Gracefully handles camera failures, API errors, and timeouts - 22 tests with 100% coverage Addresses the llava multimodal gap noted in config/ollama.json5

codecov · 2026-03-12T05:38:33Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ All tests successful. No failed tests found.

📢 Thoughts on this report? Let us know!

…ss signature

Wanbogang requested review from a team as code owners March 12, 2026 05:33

github-actions bot added robotics Robotics code changes python Python code tests Test files labels Mar 12, 2026

fix: remove return type annotation from raw_to_text to match base cla…

4af6ca3

…ss signature

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add local Ollama VLM input plugin for multimodal visual reasoning#2467

feat: add local Ollama VLM input plugin for multimodal visual reasoning#2467
Wanbogang wants to merge 2 commits intoOpenMind:mainfrom
Wanbogang:feat/vlm-ollama-local

Wanbogang commented Mar 12, 2026

Uh oh!

codecov bot commented Mar 12, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

Wanbogang commented Mar 12, 2026

Summary

Changes

Usage

Uh oh!

codecov bot commented Mar 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

codecov bot commented Mar 12, 2026 •

edited

Loading