Skip to content
View Nobi004's full-sized avatar

Organizations

@Joint-Venture-AI

Block or report Nobi004

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Nobi004/README.md

πŸ‘‹ About Me

I'm a Generative AI Engineer & Team Lead at Joint Venture AI (Betopia Group), where I turn brittle GenAI prototypes into production systems β€” schema-validated, observable, and tested, not vibes-checked. My work spans document AI (OCR β†’ transformer-based token classification β†’ graph neural networks for structured extraction), LLM inference optimization, and developer tooling that makes GenAI pipelines as rigorously testable as any other piece of software.

Originally trained as an Electrical & Electronic Engineer (Faridpur Engineering College, '25), now based in Dhaka, Bangladesh β€” building toward AI products for a global market.

πŸ† 2nd Runner-Up, SEC EEE FEST 2024

A few numbers from the production side of the work:

Revenue Generated App Installs Hallucination Extraction Errors Latency

⚑ Right Now

  • πŸ”­ Leading GenAI systems work at Joint Venture AI (Betopia Group)
  • βš™οΈ Building Driftgate β€” a Go-native regression-testing CLI for LLM prompts, solo, nights and weekends
  • πŸ“š Deepening: LLM training infrastructure Β· Go systems engineering Β· production MLOps Β· agentic systems at scale
  • 🎯 Open to Generative AI Engineer / AI Systems Tech Lead roles β€” remote-first, open to relocation

πŸš€ Flagship Work

πŸ›‘οΈ Driftgate

Regression testing for LLM prompts, shipped as a single static binary. Most prompt changes ship on vibes β€” "I read the output, looks fine." Driftgate turns that into an actual test suite: define expected behaviors as assertions, run them against every prompt change, catch regressions before they reach production. Written in Go specifically so it compiles to one dependency-free binary β€” the whole point is that it runs inside air-gapped and regulated enterprise environments where pip install isn't an option. Currently expanding assertion types and CI integrations ahead of a public launch via GoReleaser + Homebrew. Go CLI Regression Testing Air-Gapped Deployment

πŸ“„ Production Insurance-Document Extraction Pipeline

A 3-stage extraction system for ACORD 25/101 forms that survives messy real-world scans. PaddleOCR handles layout-aware text extraction, LayoutLMv3 runs multi-head BIO token classification, and a GAT + GraphSAGE combination reasons over the document spatially β€” because insurance forms aren't linear text, they're a grid of related fields. Outputs strict, null-safe JSON with hybrid confidence routing (graph β‰₯ 0.85 β†’ model β‰₯ 0.75 β†’ rule-based fallback). Cut extraction errors by 35%. PyTorch LayoutLMv3 PaddleOCR Graph Neural Networks

βš™οΈ Adaptive LLM Inference Gateway

A systems-design deep dive into adaptive LLM serving β€” under real constraints, not unlimited cloud budget. Built entirely on free-tier Kaggle T4 GPUs (16GB VRAM, zero budget), serving the Qwen2.5-Instruct family (0.5B–1.5B, AWQ int4). Development order was eval-first: a frozen eval set and a mock backend shipped before a single GPU cycle was spent β€” because you can't optimize what you haven't measured yet. LLM Serving Qwen2.5 Systems Design GPU-Constrained Inference

πŸ—‚οΈ More work β€” ALPVS, offline vision systems, earlier extraction pipelines
  • ALPVS (AI Policy Validation System) β€” Fully offline, air-gapped policy validation architecture, documented to enterprise standard with a full TDR and CTO-level architecture blueprint.
  • Offline OCR/Vision Toolkit β€” A fully offline Streamlit application built to SOLID principles, with a pluggable model registry for swapping vision backbones without touching the pipeline.
  • Early ACORD Extraction Engine β€” Gemma-based extraction with an image-preprocessing pipeline and section-by-section validation-retry loops, an earlier iteration of the graph-based system above.

πŸ› οΈ Tech Stack

Languages Python Go

AI / ML PyTorch TensorFlow scikit-learn

Document AI / Computer Vision OpenCV Pandas NumPy

Infra / Tools Docker Git Linux Streamlit


πŸ“Š GitHub Analytics

nobi004's GitHub stats Top languages GitHub streak stats

πŸ“« Let's Talk

If you're hiring for GenAI/LLM systems work, or just want to talk shop about making AI pipelines actually production-grade, my inbox is open.

Say hello

footer banner

Pinned Loading

  1. accord25_extractor accord25_extractor Public

    Python

  2. ai-engineering-assessments ai-engineering-assessments Public

    Python

  3. ai_content_studio ai_content_studio Public

    Python

  4. eeegpt1.0 eeegpt1.0 Public

    Python