Skip to content
View puneeshkhanna's full-sized avatar

Block or report puneeshkhanna

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
puneeshkhanna/README.md

Hi πŸ‘‹, I'm Puneesh Khanna

Principal AI Research Engineer Β· Building Falcon LLMs πŸ¦…

Full LLM lifecycle β€” data, pre-training, post-training (SFT + RL), evaluation, and large-scale deployment.


🧠 About me

  • πŸ”¬ Principal AI Research Engineer at Technology Innovation Institute (TII), Abu Dhabi
  • πŸ¦… Lead developer on the Falcon LLM team β€” Falcon 3, Falcon-H1, and Falcon-H1R model families (0.5B β†’ 74B)
  • πŸš€ Large-scale, multi-node GPU training & inference with NeMo-RL, Megatron-LM, veRL, vLLM, and SGLang
  • πŸ› οΈ Previously AI Frameworks Architect at Intel β€” optimizing LLMs (LLaMA, BLOOM, GPT-class) on AI accelerators
  • πŸ“ˆ 20+ years of overall industry experience

🧰 Focus areas

Pre-training Β· SFT Β· RL (GRPO Β· DAPO Β· DPO) Β· Reasoning & test-time scaling Β· Synthetic data generation Β· Long-context training Β· TP / PP / DP / FSDP Β· Context & sequence parallelism

πŸ“š Selected publications

Year Title Link
2026 Falcon-H1R: Pushing the Reasoning Frontiers with a Hybrid Model for Efficient Test-Time Scaling arXiv
2026 Falcon-H1-Tiny: A Series of Extremely Small, Yet Powerful Language Models HF Blog
2025 Falcon-H1: A Family of Hybrid-Head Language Models Redefining Efficiency and Performance arXiv
2024 Welcome to the Falcon 3 Family of Open Models HF Blog
2023 Defect Classification for Integrated Circuits Contamination on Land Grid Arrays IEEE
2022 Screening Deep Learning Inference Accelerators at the Production Lines VLSI-2022
2021 Identify and Localize COVID-19 Abnormalities on Chest Radiographs (MSc thesis) ResearchGate

πŸ“« Connect


Pinned Loading

  1. DeepLearning DeepLearning Public

    Deep Learning Understanding Projects

    Jupyter Notebook

  2. MachineLearning MachineLearning Public

    Machine Learning understanding projects.

    Jupyter Notebook

  3. Tensor-Parallelism Tensor-Parallelism Public

    Jupyter Notebook

  4. huggingface/optimum-habana huggingface/optimum-habana Public

    Easy and lightning fast training of πŸ€— Transformers on Habana Gaudi processor (HPU)

    Python 214 273

  5. deepspeedai/DeepSpeed deepspeedai/DeepSpeed Public

    DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.

    Python 42.6k 4.9k