DEV Community

# rag

Retrieval augmented generation, or RAG, is an architectural approach that can improve the efficacy of large language model (LLM) applications by leveraging custom data.

Posts

👋 Sign in for the ability to sort posts by relevant, latest, or top.
I Built 'Chat With Your Docs' From Scratch — Supabase + pgvector + a Free Local Embedder

I Built 'Chat With Your Docs' From Scratch — Supabase + pgvector + a Free Local Embedder

Comments
2 min read
Two Pre-Registered Benchmarks for Audit-Native RAG: RAB (EU AI Act 10/12/19) + LRB (Time-Travel Retrieval)

Two Pre-Registered Benchmarks for Audit-Native RAG: RAB (EU AI Act 10/12/19) + LRB (Time-Travel Retrieval)

Comments
3 min read
Most RAG Problems Are Retrieval Problems. Here Are 8 Fixes That Worked for Me

Most RAG Problems Are Retrieval Problems. Here Are 8 Fixes That Worked for Me

Comments
4 min read
Building Modular AI Agent Features with Pydantic AI Capabilities

Building Modular AI Agent Features with Pydantic AI Capabilities

Comments
2 min read
A Chinese 8B model beat the Western 8B models at Japanese RAG. I still wouldn't put it in the default deployment — and that distinction is the point.

A Chinese 8B model beat the Western 8B models at Japanese RAG. I still wouldn't put it in the default deployment — and that distinction is the point.

Comments
4 min read
RAG should never be your default

RAG should never be your default

Comments
3 min read
Define the state of our agent

Define the state of our agent

Comments 1
6 min read
Query Rewriting Before Retrieval: The Cheap Recall Win Most Skip

Query Rewriting Before Retrieval: The Cheap Recall Win Most Skip

Comments
7 min read
AI Agents Level Up Workflows: Terraform MCP, WebMCP, Pinecone Integrations

AI Agents Level Up Workflows: Terraform MCP, WebMCP, Pinecone Integrations

Comments
4 min read
Context Compression Before the LLM: Cutting Tokens Without Cutting Recall

Context Compression Before the LLM: Cutting Tokens Without Cutting Recall

Comments
6 min read
# GraphRAG: The End-to-End Guide to Reducing Hallucination and Automating Complex Workflows

# GraphRAG: The End-to-End Guide to Reducing Hallucination and Automating Complex Workflows

Comments
15 min read
Metadata Filtering Before Vector Search: The Recall Win Nobody Measures

Metadata Filtering Before Vector Search: The Recall Win Nobody Measures

Comments
7 min read
Why my first RAG layer starts in Postgres, not in a standalone vector database

Why my first RAG layer starts in Postgres, not in a standalone vector database

Comments
3 min read
I Built 48 Production AI Systems in 60 Days — Here Is What Nobody Tells You About Real AI Engineering

I Built 48 Production AI Systems in 60 Days — Here Is What Nobody Tells You About Real AI Engineering

1
Comments 1
8 min read
I built a "boring" RAG demo over World Cup data — SQLite, sqlite-vec, and no framework

I built a "boring" RAG demo over World Cup data — SQLite, sqlite-vec, and no framework

Comments
4 min read
👋 Sign in for the ability to sort posts by relevant, latest, or top.