Designing reliable AI Systems for Production Environments
What I'm Learning About Enterprise AI Architecture.
Subscribe via Atom to get the latest articles and updates immediately
Copy link to your Atom reader
https://raulferrergarcia.com/atom.xml
What I'm Learning About Enterprise AI Architecture.
A prompt injection attack on an educational AI system is not just a security incident. Under EU AI Act Article 9, it is a risk your management system should have anticipated, documented, and mitigated before deployment.
Naive RAG retrieval relies on vector similarity alone. This fails silently when documents are semantically similar but contextually irrelevant. Re-ranking adds a second filtering gate that evaluates actual relevance. This improves accuracy 16%, costs 14x latency, and creates an auditable decision trail required by the EU AI Act.
RAG isn’t a single pattern, but a family of architectures. And choosing the wrong architecture isn’t a performance problem, but a design flaw.
The gap between the architecture diagram and the working system , documented from the inside.
Underneath all the layers of modern AI sits a single architectural breakthrough, the Transformer. Understanding it is not an academic curiosity; it's a requirement for designing reliable systems.
Atom (Atom Syndication Format) is an XML-based standard for describing feeds and their items. With Atom, you can: