Implementing OCR with a local visual model run by ollama.
-
Updated
Nov 27, 2024 - TypeScript
Implementing OCR with a local visual model run by ollama.
An automated tool that uses AI vision models to detect smoking scenes in movies and automatically adds appropriate disclaimers, eliminating manual frame-by-frame editing
Awesome AI Reasearch Work
A PyTorch image classifier using RegNetY and Albumentations on the Fashion MNIST dataset. Trains with TQDM progress, plots loss curves, and supports clean modular design.
PDF Compliance Checker AI
Image Similarity Search Engine is recreates Google Lens functionalities using a ResNet model. It allows users to find similar images based on a query image by performing feature extraction and similarity search.
Robust Content Based Image Retreival which utilizes Vision transformer, Colors, Metadata and User Feedbacks for the images uploaded and to retreive images from the system as per user query, using combination of Boolean model and VSM.
Add a description, image, and links to the vison-models topic page so that developers can more easily learn about it.
To associate your repository with the vison-models topic, visit your repo's landing page and select "manage topics."