Gopi Krishna Tummala

Tag: performance

All the articles with the tag "performance".

Advanced MLOps & Production
40 MIN READ

The Custom Kernel Craze — Handcrafting GPU Performance

Why modern AI teams are handcrafting GPU kernels—from FlashAttention to Triton code—and how silicon-level tuning is the new frontier of MLOps.

November 11, 2025
Advanced MLOps & Production
45 MIN READ

The DNA of Data: Parquet, Arrow, and the Quest for Analytic Speed

The unsung hero of modern data processing is how we structure data itself. Learn how Apache Parquet and Apache Arrow solve the fundamental trade-off between storage efficiency and compute speed.

December 3, 2025