Automated RAG Evaluation without Human Labels (Ragas in Practice)
Learning and building automated evaluation system with Ragas. Can LLMs automatically evaluate RAG systems? Are human labels always needed?
Learning and building automated evaluation system with Ragas. Can LLMs automatically evaluate RAG systems? Are human labels always needed?
Exploring the practical benefits of the Matryoshka Representation Learning technique through my own experiment. Why some models can be shrunk without "losings" performance, how it works, and does it always work?
Explaining and building real use case for Hypothetical Document Embeddings (HyDE) to showcase that hallucinations are not always bad
Why LLMs are sensitive to their early tokens, and where prompting techniques like Chain-of-Thought and Tree-of-Thought get their power. This is my attempt to understand why early tokens matter, what ToT does behind the scenes, and whether it’s worth using outside of toy projects.
Building GEPA prompt optimizer from scractch to optimize compound AI system prompts using DSPy framework.
Real world use case for building RAG and learning from the building. Building fully functional travel itinerary planner with AI agents and RAG.
Using LLMs to catergorize 10,000 attractions efficiently. Learning blog post how to utilize LLMs to augment my travel app.
Guide on how to display thousands of markers on a leaflet map using supercluster and next.js
Deep dive into building a soft margin SVM in Python using a quadratic solver. Understanding the math behind the SVM and how to implement it from scratch.
Understanding logistic regression mathematically—exploring likelihood functions, gradient calculations, and how optimization works.