Skip to content
Gopi Krishna Tummala

Tag: computer-vision

All the articles with the tag "computer-vision".

  • Advanced GenAI Systems
    45 MIN READ

    Diffusion — From Molecules to Machines

    A deep dive into the physics and probability of diffusion models. Learn how reversing a stochastic process became the foundation for modern generative AI, from Stable Diffusion to robotics and protein design.

  • Advanced GenAI Systems
    40 MIN READ

    Image Diffusion Models: From U-Net to DiT

    The evolution of image diffusion architectures. Learn how we moved from convolutional U-Nets to scalable Diffusion Transformers (DiT), and why treating images like language changed everything.

  • Advanced GenAI Systems
    45 MIN READ

    The Frontier: Sora, Veo, and the Future of Video

    Exploring the state-of-the-art in video generation. Learn how Sora and Veo use Spatiotemporal Transformers to simulate the physical world, and the challenges of achieving perfect motion fidelity.

  • Advanced GenAI Systems
    40 MIN READ

    Physics-Aware Video Diffusion: From Pixels to Laws

    How to move from visual imitation to law-governed motion. Deep dive into injecting PDEs into neural networks, implicit physics extraction, and LLM-guided physical reasoning.

  • Advanced GenAI Systems
    45 MIN READ

    The Training Lifecycle: From Noise to Nuance

    How to train a world-class diffusion model. Covers the complete lifecycle: from large-scale pre-training on noisy web data to specialized post-training, alignment, and aesthetic fine-tuning.