Gopi Krishna Tummala

Posts

All the articles I've posted.

Advanced MLOps & Production
40 MIN READ

The Hidden Engine of AI — Training Frameworks and Resilience

A guide to scaling AI models beyond the data pipeline—from training loops and distributed frameworks to 3D parallelism and fault tolerance.

February 1, 2025
Advanced MLOps & Production
25 MIN READ

The Infrastructure-First MLOps Roadmap: From Data DNA to Agentic AI

Standard MLOps advice tells you to learn Git and Docker. But for the next generation of AI Engineers, that's just the baseline. This roadmap focuses on the Infrastructure Round—deep-diving into how data is structured for speed, how it's fed into models, how those models scale across clusters, and how we squeeze every drop of performance out of the silicon.

December 18, 2025
Advanced MLOps & Production
40 MIN READ

Life of a Tensor: A Deep Dive into Production Inference

A comprehensive deep-dive into production inference optimization, tracing the path of a request through LLM and diffusion model serving systems. Understanding the bottlenecks from gateway to GPU kernel execution.

January 28, 2025
Advanced MLOps & Production
45 MIN READ

The Post-Training Playbook: Mastering PEFT, Alignment, and Adaptation

Pre-training gives models capability; post-training gives them value. A deep dive into LoRA, DoRA, DPO, and how we sculpt intelligence after the initial birth.

January 15, 2026
Advanced MLOps & Production
45 MIN READ

The DNA of Data: Parquet, Arrow, and the Quest for Analytic Speed

The unsung hero of modern data processing is how we structure data itself. Learn how Apache Parquet and Apache Arrow solve the fundamental trade-off between storage efficiency and compute speed.

December 3, 2025
Advanced MLOps & Production
35 MIN READ

vLLM and the Trilogy of Modern LLM Scaling

How PagedAttention, Continuous Batching, Speculative Decoding, and Quantization unlock lightning-fast, reliable large language model serving.

November 10, 2025