Skip to content
Gopi Krishna Tummala
← Back to Home

MLOps & Production

The Infrastructure Round

Scaling, serving, and optimizing AI systems. Custom kernels, inference engines, and production infrastructure.

Learning Path

  • Start with **Datasets and Dataloaders** for efficient data pipelines
  • Move to **Training Frameworks** for distributed training and resilience
  • Explore **vLLM** for serving infrastructure (PagedAttention, Continuous Batching)
  • Finish with **Custom Kernels** for GPU optimization (FlashAttention)

All Posts in This Track

4 articles covering mlops & production