Skip to content
Gopi Krishna Tummala

Posts

6 of 47 articles — browse by tag or search to filter.

  • Advanced Robotics
    40 MIN READ

    Diffusion for Action: Trajectories and Policy

    How diffusion models predict action sequences instead of pixels. Covers Diffusion Policy, world models for robotics, and connecting diffusion to reinforcement learning for autonomous systems.

  • Advanced GenAI Systems
    40 MIN READ

    Image Diffusion Models: From U-Net to DiT

    The evolution of image diffusion architectures. Learn how we moved from convolutional U-Nets to scalable Diffusion Transformers (DiT), and why treating images like language changed everything.

  • Advanced GenAI Systems
    45 MIN READ

    The Frontier: Sora, Veo, and the Future of Video

    Exploring the state-of-the-art in video generation. Learn how Sora and Veo use Spatiotemporal Transformers to simulate the physical world, and the challenges of achieving perfect motion fidelity.

  • Advanced GenAI Systems
    45 MIN READ

    The Training Lifecycle: From Noise to Nuance

    How to train a world-class diffusion model. Covers the complete lifecycle: from large-scale pre-training on noisy web data to specialized post-training, alignment, and aesthetic fine-tuning.

  • Advanced GenAI Systems
    40 MIN READ

    Physics-Aware Video Diffusion: From Pixels to Laws

    How to move from visual imitation to law-governed motion. Deep dive into injecting PDEs into neural networks, implicit physics extraction, and LLM-guided physical reasoning.

  • Advanced GenAI Systems
    40 MIN READ

    Sampling & Guidance: The Dialects of Noise

    How to accelerate diffusion sampling and steer creativity. Learn the mechanics of DDIM, DPM-Solver, Classifier-Free Guidance (CFG), and the math of negative prompting.