Skip to content
View Jayanth-reflex's full-sized avatar

Block or report Jayanth-reflex

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Jayanth-reflex/README.md
header

typing

profile views status email linkedin

About me

I'm a Generative AI Software Engineer with 4 years of experience building production-grade software at the intersection of LLM systems and backend infrastructure. I've shipped production Generative AI platforms using RAG, semantic and vector search, agentic AI workflows (LangGraph), and Responsible AI guardrails, serving millions of users.

Right now, I'm fine-tuning Qwen3.6-35B-A3B (a 35-billion-parameter large language model) on AMD MI300X GPUs. The model achieved zero refusals across 465 safety tests using a technique called heretic-abliteration, with safety policies enforced at the application layer.

I optimize for latency, cost, and correctness, without the shortcuts that bite you 6 months later.

Tech Stack

Languages Python TypeScript Java SQL Bash
AI / ML PyTorch HuggingFace vLLM LangChain PEFT / LoRA ROCm RAG LangGraph Weights & Biases
Backend FastAPI Spring Boot Node.js Django Flask
Frontend Next.js React TypeScript TailwindCSS
Databases PostgreSQL MongoDB Redis MySQL
Cloud / DevOps Docker Kubernetes AWS GCP Azure Terraform GitHub Actions
Tools Cursor Claude Code VS Code Postman Linux

Featured Projects

Domain-specialized AI fine-tuning project. Built for the AMD Hackathon 2026. Specialized fine-tune of Qwen3.6-35B-A3B (a 35-billion-parameter large language model) on AMD MI300X GPUs. Achieved zero refusals across 465 safety tests via a technique called heretic-abliteration, with safety policies enforced at the application layer.

Python ROCm vLLM PEFT LoRA

Open-source AI research project. Research on safe model abliteration and post-training quantization (GPTQ, AWQ, and GGUF formats). Includes tooling, evaluations, and reproducible recipes for the broader community.

Python transformers bitsandbytes llama.cpp

Production-grade agentic AI chatbot. Built with typed tool calls, persistent memory, and streaming responses. Provider-agnostic, so it works seamlessly with multiple AI providers (OpenRouter, Ollama, and Anthropic).

TypeScript Next.js Vercel AI SDK

Real-time disease tracking dashboard. Pulls live data from the World Health Organization (WHO) and Johns Hopkins University. Features map-based geospatial visualization and trend forecasting.

TypeScript React D3 Mapbox

Open to Hire

I'm currently open to senior or staff engineer roles (individual contributor positions, not management).

I'm looking for opportunities in:

  • 🤖 Generative AI / LLM Engineering · RAG, fine-tuning, inference, agentic platforms
  • ⚙️ Backend Platforms · high-throughput Python or Java services, distributed systems
  • 🛠 Developer Tooling · agentic systems, LLM-powered developer tools

I'm based in India and open to remote roles or relocation. I respond to all messages within 24 hours.

email cta   linkedin cta

footer

Pinned Loading

  1. global-disease-tracker global-disease-tracker Public

    TypeScript 1

  2. conscious-media-landing conscious-media-landing Public

    JavaScript

  3. llm-abliteration-quantization llm-abliteration-quantization Public

    Python 1

  4. newman newman Public

    JavaScript 1