Gopi Krishna Tummala

← Back to Home

GenAI & LLM Systems

The System Design Round

Production systems for generative AI—scaling, optimization, and serving large language models at scale.

Suggested Learning Path

1 Start with **Transformers** to understand the architecture
2 Move to **Diffusion Models** for image and video generation
3 Explore **Vision-Language Models** for multimodal systems
4 Finish with **Video Diffusion** for temporal generation

All Posts in This Track

9 articles covering genai systems

Intermediate GenAI Systems
22 MIN READ

When AI Sees and Speaks — The Rise of Vision-Language Models

A high level view on how modern vision-language models connect pixels and prose, from CLIP and BLIP to Flamingo, MiniGPT-4, Kosmos, and Gemini.

November 9, 2025
Advanced GenAI Systems
45 MIN READ

Building Production-Grade Multimodal RAG Systems

From naive vector search to industry-standard multimodal RAG. Master hybrid search, query rewriting, cross-encoder reranking, and the architecture of high-precision retrieval systems.

December 18, 2025
Advanced GenAI Systems
45 MIN READ

Diffusion — From Molecules to Machines

A deep dive into the physics and probability of diffusion models. Learn how reversing a stochastic process became the foundation for modern generative AI, from Stable Diffusion to robotics and protein design.

January 15, 2025
Advanced GenAI Systems
40 MIN READ

Image Diffusion Models: From U-Net to DiT

The evolution of image diffusion architectures. Learn how we moved from convolutional U-Nets to scalable Diffusion Transformers (DiT), and why treating images like language changed everything.

January 25, 2025
Advanced GenAI Systems
45 MIN READ

The Frontier: Sora, Veo, and the Future of Video

Exploring the state-of-the-art in video generation. Learn how Sora and Veo use Spatiotemporal Transformers to simulate the physical world, and the challenges of achieving perfect motion fidelity.

January 25, 2025
Advanced GenAI Systems
45 MIN READ

The Training Lifecycle: From Noise to Nuance

How to train a world-class diffusion model. Covers the complete lifecycle: from large-scale pre-training on noisy web data to specialized post-training, alignment, and aesthetic fine-tuning.

January 25, 2025
Advanced GenAI Systems
40 MIN READ

Physics-Aware Video Diffusion: From Pixels to Laws

How to move from visual imitation to law-governed motion. Deep dive into injecting PDEs into neural networks, implicit physics extraction, and LLM-guided physical reasoning.

January 20, 2025
Advanced GenAI Systems
40 MIN READ

Sampling & Guidance: The Dialects of Noise

How to accelerate diffusion sampling and steer creativity. Learn the mechanics of DDIM, DPM-Solver, Classifier-Free Guidance (CFG), and the math of negative prompting.

January 25, 2025
Advanced GenAI Systems
40 MIN READ

Video Diffusion: The Fourth Dimension

The fundamentals of video diffusion models. Learn how we extend 2D diffusion to time, the mechanics of temporal attention, and the architectural shifts required for motion consistency.

January 25, 2025