Designing reliable AI Systems for Production Environments
What I'm Learning About Enterprise AI Architecture.
Subscribe via RSS to get the latest articles and updates imediately
Copy link to your RSS reader
https://raulferrergarcia.com/rss.xml
What I'm Learning About Enterprise AI Architecture.
A prompt injection attack on an educational AI system is not just a security incident. Under EU AI Act Article 9, it is a risk your management system should have anticipated, documented, and mitigated before deployment.
Naive RAG retrieval relies on vector similarity alone. This fails silently when documents are semantically similar but contextually irrelevant. Re-ranking adds a second filtering gate that evaluates actual relevance. This improves accuracy 16%, costs 14x latency, and creates an auditable decision trail required by the EU AI Act.
RAG isn’t a single pattern, but a family of architectures. And choosing the wrong architecture isn’t a performance problem, but a design flaw.
The gap between the architecture diagram and the working system , documented from the inside.
Underneath all the layers of modern AI sits a single architectural breakthrough, the Transformer. Understanding it is not an academic curiosity; it's a requirement for designing reliable systems.
RSS (Really Simple Syndication) is a standard format for publishing frequently updated content. With RSS, you can:
It is recommended to use Feedly, Inoreader or other RSS readers to subscribe to this site.