Sampling & Guidance: The Dialects of Noise
How to accelerate diffusion sampling and steer creativity. Learn the mechanics of DDIM, DPM-Solver, Classifier-Free Guidance (CFG), and the math of negative prompting.
All the articles with the tag "computer-vision".
How to accelerate diffusion sampling and steer creativity. Learn the mechanics of DDIM, DPM-Solver, Classifier-Free Guidance (CFG), and the math of negative prompting.
The fundamentals of video diffusion models. Learn how we extend 2D diffusion to time, the mechanics of temporal attention, and the architectural shifts required for motion consistency.
If you don't know where your eyes are relative to your feet, you trip. Covers intrinsics, extrinsics, SE(3) transforms, online vs. offline calibration, and time synchronization.
An intuitive introduction to Variational Autoencoders — how compressing data into probabilistic codes enables machines to generate realistic images, sounds, and structures.
My research journey from wireless communication foundations to solving the camera calibration bottleneck that enables autonomous vehicle vision.
How we used deep learning to automatically calibrate traffic cameras by observing vehicle motion—work that won Best Paper Award at ACM BuildSys 2017.