Building intelligent systems with clarity.

I'm Gopi Krishna Tummala. I bridge the gap between Research Papers and Production Systems. Here is my blueprint for modern AI engineering.

About My Work Browse All Posts

Connect

Always learning

Choose Your Path

Structured learning paths organized by domain—from generative AI and production systems to autonomous vehicles and agentic intelligence.

🧠

GenAI Systems

From Transformers to Diffusion Models. Understanding the architectures and algorithms powering modern generative AI.

Explore

⚙️

MLOps & Production

Scaling, serving, and optimizing AI systems. Custom kernels, inference engines, and production infrastructure.

Explore

🏎️

Autonomous Systems

How self-driving cars actually work. Prediction, calibration, sensing, and closed-loop reasoning.

Explore

🤖

Agentic AI

From ReAct loops to multi-agent systems. Building intelligent agents that reason, plan, and act autonomously.

Explore

Journey at a glance

A quick tour through the roles, research labs, and collaborations that shaped my path in AI and autonomous systems.

2024 — Present
Senior Machine Learning Engineer

Adobe · Creative Cloud & Firefly · San Jose, CA

Leading large-scale data pipelines, training infrastructure, and responsible generative AI initiatives that power Firefly and Creative Cloud surfaces.
2022 — 2024
Software Engineer, Prediction

Autonomous Vehicle Systems · Bay Area, CA

Shipped multi-agent prediction models for L3/L4/L5 autonomous vehicle fleets and co-designed the training framework and dataloaders that kept the stack fed with fresh data.
2018 — 2022
Senior Systems Engineer

Qualcomm Research · San Diego, CA

Led prediction for Qualcomm's L3 highway autonomous driving stack—owning forecasting models, simulation harnesses, and post-drive analytics. Earlier built integration and test automation for the stack.
2018
Graduated from The Ohio State University

Ph.D. Computer Science & Engineering

Dissertation on collaborative perception and behavior prediction for intelligent transportation systems.
2016
Research Intern

Microsoft Research · Bangalore, India

Designed AutoCalib—large-scale traffic camera calibration with <10% speed error—in Microsoft's video analytics platform.
2013 — 2018
Graduate Research Assistant

The Ohio State University · Columbus, OH

Built SmartDashCam, Soft-Swipe, RoadView, and RoadMap; taught introductory programming; collaborated with Honda on live calibration and lane-level localization.
2012 — 2013
Software Analyst

Standard Chartered Bank · Chennai, India

Developed reporting systems and automation scripts for global private banking infrastructure.
2011
Project Intern

Tata Elxsi · Chennai, India

Optimized LTE PDCCH blind decoding algorithms and explored DSP-based radio prototyping.
2008 — 2012
B.Tech Electrical Engineering & Minor in Mathematics

Indian Institute of Technology Madras · Chennai, India

Graduated with honors; led hostel council committees.

Featured work

A curated selection of recent publications and projects that explore robust perception, generative modeling, and multi-agent systems at scale.

Building Production-Grade Multimodal RAG Systems with OpenSearch

Advanced GenAI Systems 25 min

Dec 18, 2025

A modern, industry-standard approach to building robust RAG systems using OpenSearch as the core engine. Transition from simple vector retrieval to production-grade multimodal systems handling text, images, and video with advanced patterns like hybrid search, query rewriting, parent-document retrieval, and cross-encoder reranking.
Beyond Inference: Architecting Infrastructure for Agentic MLOps & The Model Context Protocol (MCP)

Advanced MLOps & Production 35 min

Dec 18, 2025

The journey from stateless inference to stateful, tool-augmented AI agents requires a complete reimagining of MLOps infrastructure. Learn how the Model Context Protocol (MCP), secure sandbox environments, distributed tracing, and holistic versioning enable the next generation of agentic AI systems.
The Infrastructure-First MLOps Roadmap: From Data DNA to Custom Kernels

Advanced MLOps & Production 20 min

Dec 18, 2025

Standard MLOps advice tells you to learn Git and Docker. But for the next generation of AI Engineers, that's just the baseline. This roadmap focuses on the Infrastructure Round—deep-diving into how data is structured for speed, how it's fed into models, how those models scale across clusters, and how we squeeze every drop of performance out of the silicon.
The DNA of Data: Parquet, Arrow, and the Quest for Analytic Speed

Advanced MLOps & Production 45 min

Dec 3, 2025

The unsung hero of modern data processing is how we structure data itself. Learn how Apache Parquet and Apache Arrow solve the fundamental trade-off between storage efficiency and compute speed in large-scale analytics and ML pipelines.
Reinforcement Learning — From Intuition to Algorithms

Advanced Fundamentals 35 min

Nov 12, 2025

A narrative-first walkthrough of reinforcement learning, starting with everyday intuition and ending with the math behind Q-learning and DQN.
The Custom Kernel Craze — Why Developers Are Taking the Wheel on GPU Optimization

Advanced MLOps & Production 25 min

Nov 11, 2025

Why modern AI teams are handcrafting GPU kernels—from FlashAttention to TPU Pallas code—and how smarter tooling is making silicon-level tuning accessible.
When AI Sees and Speaks — The Rise of Vision-Language Models

Intermediate GenAI Systems 22 min

Nov 9, 2025

A high level view on how modern vision-language models connect pixels and prose, from CLIP and BLIP to Flamingo, MiniGPT-4, Kosmos, and Gemini.
vLLM and the Trilogy of Modern LLM Scaling

Advanced MLOps & Production 30 min

Nov 10, 2025

How PagedAttention, Continuous Batching, Speculative Decoding, and Quantization unlock lightning-fast, reliable large language model serving.
Diffusion — From Molecules to Machines

Intermediate GenAI Systems 28 min

Jan 15, 2025

A clear introduction to diffusion and guided diffusion — how a simple physical process became a foundation for modern generative AI, from Stable Diffusion to robotics and protein design.
The Hidden Engine of AI — Training Frameworks and Resilience

Advanced MLOps & Production 30 min

Feb 1, 2025

A reader-friendly guide to scaling AI models beyond the data pipeline—from training loops and distributed frameworks to checkpoints, mixed precision, and fault tolerance.
XGBoost — The Art of Boosting Trees with Second-Order Gradients

Advanced Fundamentals 25 min

Jan 25, 2025

A deep dive into XGBoost — how second-order Taylor approximations and sophisticated regularization make it the dominant algorithm for structured data, bridging mathematical rigor with system engineering excellence.
Diffusion for Action: Trajectories and Policy

Advanced Robotics 20 min

Jan 25, 2025

How diffusion models predict action sequences instead of pixels. Covers Diffusion Policy, world models for robotics, and connecting diffusion to reinforcement learning for autonomous systems.
Image Diffusion Models: From U-Net to DiT

Advanced GenAI Systems 15 min

Jan 25, 2025

The evolution of image diffusion models from U-Net architectures to Diffusion Transformers (DiT). Covers latent diffusion, the DiT revolution, and the complete image generation pipeline.
Modern Video Models & Motion: Sora, Veo 3, Open-Sora, and Motion Modeling

Advanced GenAI Systems 22 min

Jan 25, 2025

Deep dive into state-of-the-art video generation models: Sora, Veo 3, and Open-Sora. Plus motion modeling techniques using optical flow, geometry, and diffusion fields.
Pre-Training & Post-Training: Building and Aligning Video Diffusion Models

Advanced GenAI Systems 20 min

Jan 25, 2025

How video diffusion models are built through pre-training and aligned through post-training. Covers the billion-frame training problem, DPO, RLHF, and the complete training pipeline.
Sampling & Guidance: The Dialects of Noise

Advanced GenAI Systems 18 min

Jan 25, 2025

How to accelerate diffusion sampling and control output quality. Covers DDIM, DPM-Solver, Classifier-Free Guidance (CFG), negative prompting, and inference optimization techniques.
Video Diffusion Fundamentals: The Temporal Challenge

Advanced GenAI Systems 18 min

Jan 25, 2025

Why video is harder than images, the DiT revolution for video, and how diffusion models learn temporal consistency. Covers V-DiT, AsymmDiT, and the mathematical foundations of video generation.
The Hidden Engine of AI — Datasets and Dataloaders

Advanced MLOps & Production 35 min

Jan 25, 2025

A deep dive into how datasets and dataloaders power modern AI—from the quiet pipeline that feeds models to the sophisticated tools that make training efficient. Understanding the hidden engine that keeps AI systems running.
Module 01: The "Why" and The Architecture

Advanced Robotics 25 min

Jan 25, 2025

Why L5 autonomy is harder than a moon landing. Understanding ODD, latency loops, compute constraints, and the probability of failure in autonomous systems.
Module 02: How Cars Learn to See (Sensors)

Advanced Robotics 35 min

Jan 25, 2025

From photons to decisions: How machines reconstruct 3D reality from 2D data. Covers cameras, IPM, radar, LiDAR, and sensor fusion in an intuitive, first-principles approach.
Module 03: The Bedrock (Calibration & Transforms)

Advanced Robotics 30 min

Jan 25, 2025

If you don't know where your eyes are relative to your feet, you trip. Covers intrinsics, extrinsics, SE(3) transforms, online vs. offline calibration, and time synchronization.
Module 04: Localization — The Art of Not Getting Lost

Advanced Robotics 25 min

Jan 25, 2025

From GPS to centimeter accuracy: How autonomous vehicles know their exact position. Covers GNSS, IMU, wheel odometry, scan matching, and the Kalman Filter fusion that creates the "Blue Line."
Module 08: The Chess Master — The Art of Planning

Advanced Robotics 30 min

Jan 25, 2025

From perception to action: How autonomous vehicles make decisions. Covers cost functions, game-theoretic planning, and the modular vs. end-to-end debate.
Module 07: The Fortune Teller — The Evolution of Prediction

Advanced Robotics 35 min

Jan 24, 2025

The hardest problem in AV: predicting human irrationality. Covers the evolution from physics-based prediction to Generative AI, tracking the journey through Waymo Open Dataset Challenges.
Physics-Aware Video Diffusion Models: From Intuition to Research Frontier

Advanced GenAI Systems 35 min

Jan 20, 2025

A deep dive into physics-aware video diffusion models: how researchers inject physical constraints into generative models, the three leading technical approaches, and their practical impact on robotics and scientific simulation.
The Transformer — How Machines Pay Attention

Intermediate Fundamentals 20 min

Jan 15, 2025

An intuitive introduction to the Transformer architecture — from the attention mechanism to self-attention and cross-attention, using language translation as a concrete example.
Variational Autoencoders — From Compression to Creation

Intermediate Fundamentals 30 min

Jan 15, 2025

An intuitive introduction to Variational Autoencoders — how compressing data into probabilistic codes enables machines to generate realistic images, sounds, and structures.
Part 04: Agentic AI Design Patterns — Part 04: Failure Modes & Safety

Advanced Agentic AI 30 min

Jan 15, 2025

Part 4 of a comprehensive guide to agentic AI design patterns. Covers common failure modes, safety mechanisms, verifiable pipelines, and how to build reliable production systems.
Part 03: Agentic AI Design Patterns — Part 03: Specialized Patterns

Advanced Agentic AI 50 min

Jan 15, 2025

Part 3 of a comprehensive guide to agentic AI design patterns. Covers specialized patterns: embodied agents, 3D scene understanding, imagination loops, multi-agent societies, error recovery, and self-debugging.
Part 05: Agentic AI Design Patterns — Part 05: Production Guide

Advanced Agentic AI 35 min

Jan 15, 2025

Part 5 of a comprehensive guide to agentic AI design patterns. Covers 2025 trends, cost optimization, case studies, production checklist, and the state of the field.
Part 01: Agentic AI Design Patterns — Part 01: Foundations

Advanced Agentic AI 45 min

Jan 15, 2025

Part 1 of a comprehensive guide to agentic AI design patterns. Covers the fundamentals: ReAct loops, planning, tool use, self-consistency, and graph-based reasoning.
Part 02: Agentic AI Design Patterns — Part 02: Production Patterns

Advanced Agentic AI 40 min

Jan 15, 2025

Part 2 of a comprehensive guide to agentic AI design patterns. Covers production-ready patterns: memory management, supervisor/orchestrator, parallel tool execution, and hidden reasoning.
Agentic AI: Machines That Finally Learned to Get Stuff Done

Jan 15, 2025

An exploration of modern agent systems, with math, analogies, and examples. From ReAct loops to multi-agent societies, discover the design patterns that make AI agents think, act, and fix themselves.
The Role of Predictions in Closed-Loop Autonomous Driving

Advanced Robotics 20 min

Dec 1, 2024

Reflections on building production-grade behavior prediction systems for autonomous vehicles — and why closed-loop reasoning is the bridge between perception and planning.
AutoCalib – Automatic Traffic Camera Calibration at Scale

Advanced Robotics 15 min

Nov 1, 2017

How we used deep learning to automatically calibrate traffic cameras by observing vehicle motion—work that won Best Paper Award at ACM BuildSys 2017.
From Bits to Sight - A Journey from Wireless to Autonomous Vision

Jan 1, 2019

My research journey from wireless communication foundations to solving the camera calibration bottleneck that enables autonomous vehicle vision.

Recent writing

View all posts

Tongue Twister Practice Sheet — Clarity and Fluency for Tech Talks

Beginner 5 min

Nov 11, 2025

A structured articulation and pacing warm-up designed to help technologists speak with clarity and confidence in high-stakes meetings.
The 45-Minute Interview Flow: A Proven Framework for Cracking Google-Style Coding Interviews

Beginner Fundamentals 10 min

Jan 26, 2025

A collaborative 45-minute thinking algorithm tuned for Google-style coding interviews—classify the problem, co-design an optimal approach, code with confidence, and handle follow-ups with ease.
Book Recommendations: Personal Development, Philosophy, and Life Lessons

Beginner 5 min

Jan 21, 2025

A curated list of book recommendations covering personal development, philosophy, psychology, and life lessons from my personal library.

Building intelligent systems with clarity.

Connect

Choose Your Path

GenAI Systems

MLOps & Production

Autonomous Systems

Agentic AI

Journey at a glance

Senior Machine Learning Engineer

Software Engineer, Prediction

Senior Systems Engineer

Graduated from The Ohio State University

Research Intern

Graduate Research Assistant

Software Analyst

Project Intern

B.Tech Electrical Engineering & Minor in Mathematics

Featured work

Building Production-Grade Multimodal RAG Systems with OpenSearch

Beyond Inference: Architecting Infrastructure for Agentic MLOps & The Model Context Protocol (MCP)

The Infrastructure-First MLOps Roadmap: From Data DNA to Custom Kernels

The DNA of Data: Parquet, Arrow, and the Quest for Analytic Speed

Reinforcement Learning — From Intuition to Algorithms

The Custom Kernel Craze — Why Developers Are Taking the Wheel on GPU Optimization

When AI Sees and Speaks — The Rise of Vision-Language Models

vLLM and the Trilogy of Modern LLM Scaling

Diffusion — From Molecules to Machines

The Hidden Engine of AI — Training Frameworks and Resilience

XGBoost — The Art of Boosting Trees with Second-Order Gradients

Diffusion for Action: Trajectories and Policy

Image Diffusion Models: From U-Net to DiT

Modern Video Models & Motion: Sora, Veo 3, Open-Sora, and Motion Modeling

Pre-Training & Post-Training: Building and Aligning Video Diffusion Models

Sampling & Guidance: The Dialects of Noise

Video Diffusion Fundamentals: The Temporal Challenge

The Hidden Engine of AI — Datasets and Dataloaders

Module 01: The "Why" and The Architecture

Module 02: How Cars Learn to See (Sensors)

Module 03: The Bedrock (Calibration & Transforms)

Module 04: Localization — The Art of Not Getting Lost

Module 08: The Chess Master — The Art of Planning

Module 07: The Fortune Teller — The Evolution of Prediction

Physics-Aware Video Diffusion Models: From Intuition to Research Frontier

The Transformer — How Machines Pay Attention

Variational Autoencoders — From Compression to Creation

Part 04: Agentic AI Design Patterns — Part 04: Failure Modes & Safety

Part 03: Agentic AI Design Patterns — Part 03: Specialized Patterns

Part 05: Agentic AI Design Patterns — Part 05: Production Guide

Part 01: Agentic AI Design Patterns — Part 01: Foundations

Part 02: Agentic AI Design Patterns — Part 02: Production Patterns

Agentic AI: Machines That Finally Learned to Get Stuff Done

The Role of Predictions in Closed-Loop Autonomous Driving

AutoCalib – Automatic Traffic Camera Calibration at Scale

From Bits to Sight - A Journey from Wireless to Autonomous Vision

Recent writing

Tongue Twister Practice Sheet — Clarity and Fluency for Tech Talks

The 45-Minute Interview Flow: A Proven Framework for Cracking Google-Style Coding Interviews

Book Recommendations: Personal Development, Philosophy, and Life Lessons