Skip to content
Gopi Krishna Tummala

Tag: optimization

All the articles with the tag "optimization".

  • Intermediate Fundamentals
    25 MIN READ

    Backpropagation — The Math Behind Learning

    A complete derivation of backpropagation for MLPs — from chain rule intuition to delta propagation, with a worked numerical example showing exactly how errors flow backward through a network.

  • Advanced GenAI Systems
    40 MIN READ

    Sampling & Guidance: The Dialects of Noise

    How to accelerate diffusion sampling and steer creativity. Learn the mechanics of DDIM, DPM-Solver, Classifier-Free Guidance (CFG), and the math of negative prompting.

  • Advanced MLOps & Production
    40 MIN READ

    Life of a Tensor: A Deep Dive into Production Inference

    A comprehensive deep-dive into production inference optimization, tracing the path of a request through LLM and diffusion model serving systems. Understanding the bottlenecks from gateway to GPU kernel execution.

  • Advanced MLOps & Production
    35 MIN READ

    vLLM and the Trilogy of Modern LLM Scaling

    How PagedAttention, Continuous Batching, Speculative Decoding, and Quantization unlock lightning-fast, reliable large language model serving.

  • Advanced Fundamentals
    25 MIN READ

    XGBoost — The Art of Boosting Trees with Second-Order Gradients

    A deep dive into XGBoost — how second-order Taylor approximations and sophisticated regularization make it the dominant algorithm for structured data, bridging mathematical rigor with system engineering excellence.