Gopi Krishna Tummala

Tag: multimodal

All the articles with the tag "multimodal".

Intermediate GenAI Systems
22 MIN READ

When AI Sees and Speaks — The Rise of Vision-Language Models

A high level view on how modern vision-language models connect pixels and prose, from CLIP and BLIP to Flamingo, MiniGPT-4, Kosmos, and Gemini.

November 9, 2025
Advanced GenAI Systems
45 MIN READ

Building Production-Grade Multimodal RAG Systems

From naive vector search to industry-standard multimodal RAG. Master hybrid search, query rewriting, cross-encoder reranking, and the architecture of high-precision retrieval systems.

December 18, 2025