Gopi Krishna Tummala

Posts

6 of 47 articles — browse by tag or search to filter.

Advanced Robotics
40 MIN READ

Diffusion for Action: Trajectories and Policy

How diffusion models predict action sequences instead of pixels. Covers Diffusion Policy, world models for robotics, and connecting diffusion to reinforcement learning for autonomous systems.

January 25, 2025
Advanced GenAI Systems
40 MIN READ

Image Diffusion Models: From U-Net to DiT

The evolution of image diffusion architectures. Learn how we moved from convolutional U-Nets to scalable Diffusion Transformers (DiT), and why treating images like language changed everything.

January 25, 2025
Advanced GenAI Systems
45 MIN READ

The Frontier: Sora, Veo, and the Future of Video

Exploring the state-of-the-art in video generation. Learn how Sora and Veo use Spatiotemporal Transformers to simulate the physical world, and the challenges of achieving perfect motion fidelity.

January 25, 2025
Advanced GenAI Systems
45 MIN READ

The Training Lifecycle: From Noise to Nuance

How to train a world-class diffusion model. Covers the complete lifecycle: from large-scale pre-training on noisy web data to specialized post-training, alignment, and aesthetic fine-tuning.

January 25, 2025
Advanced GenAI Systems
40 MIN READ

Physics-Aware Video Diffusion: From Pixels to Laws

How to move from visual imitation to law-governed motion. Deep dive into injecting PDEs into neural networks, implicit physics extraction, and LLM-guided physical reasoning.

January 20, 2025
Advanced GenAI Systems
40 MIN READ

Sampling & Guidance: The Dialects of Noise

How to accelerate diffusion sampling and steer creativity. Learn the mechanics of DDIM, DPM-Solver, Classifier-Free Guidance (CFG), and the math of negative prompting.

January 25, 2025