Gopi Krishna Tummala

Tag: fine-tuning

All the articles with the tag "fine-tuning".

Advanced MLOps & Production
45 MIN READ

Post-Training Playbook: SFT, LoRA, DPO, and GRPO from First Principles

Pre-training gives a model knowledge; post-training gives it behavior. A practitioner's breakdown of SFT, LoRA/QLoRA, DPO, and GRPO — with the memory math, concrete configs, and interview reasoning that separates candidates who've done this from candidates who've read about it.

January 15, 2026
Advanced GenAI Systems
45 MIN READ

The Training Lifecycle: From Noise to Nuance

How to train a world-class diffusion model. Covers the complete lifecycle: from large-scale pre-training on noisy web data to specialized post-training, alignment, and aesthetic fine-tuning.

January 25, 2025