Skip to content
Gopi Krishna Tummala

Tag: fine-tuning

All the articles with the tag "fine-tuning".

  • Advanced MLOps & Production
    45 MIN READ

    Post-Training Playbook: SFT, LoRA, DPO, and GRPO from First Principles

    Pre-training gives a model knowledge; post-training gives it behavior. A practitioner's breakdown of SFT, LoRA/QLoRA, DPO, and GRPO — with the memory math, concrete configs, and interview reasoning that separates candidates who've done this from candidates who've read about it.

  • Advanced GenAI Systems
    45 MIN READ

    The Training Lifecycle: From Noise to Nuance

    How to train a world-class diffusion model. Covers the complete lifecycle: from large-scale pre-training on noisy web data to specialized post-training, alignment, and aesthetic fine-tuning.