Hallucination
An LLM output that is fluent and confident but factually wrong or fabricated. Hallucinations cannot be eliminated; they are constrained via RAG grounding, source-citation requirements, low-confidence rejection, scope narrowing, and verification steps for high-stakes outputs.
Prompt injection
A class of attack where untrusted content (a document, an email, a web page) contains instructions that the LLM interprets as commands. Defenses include treating retrieved content as data not code, sandboxing tool access, structured outputs validated by a second pass, and never letting LLM-generated content trigger privileged operations without human review.
Model drift
Silent change in model behavior over time. Two flavors: vendor drift — the foundation model is silently updated by the provider and behaves differently on the same inputs; data drift — your input distribution has shifted but the model is unchanged. Detected by monitoring output distributions and a frozen evaluation set.
Low-confidence rejection
An architectural pattern where the AI declines to answer (or escalates to a human) when its confidence is below a threshold. The defining property of an AI assistant that survives audit: it knows what it does not know, and says so.
Guardrails
Pre- and post-processing layers around an LLM that block undesired inputs (prompt injection, PII leakage) and outputs (offensive content, policy violations, hallucinated entities). Guardrails are necessary but not sufficient — they catch the easy cases; architecture catches the hard ones.
Rollback / kill switch
A tested mechanism to instantly revert the AI system to a previous stable version, or to disable it entirely. Required as part of the six-control minimum baseline. "Tested" is the operative word: a rollback that has never been exercised is not a rollback, it is a wish.
AI observability
Instrumenting the AI system so you can answer "what happened in this conversation, why, and how often does this pattern occur?" Includes prompt logging, output logging, retrieval logging, latency metrics, cost metrics, error rates and feedback signals. Most production failures are obvious in observability data before they show up in user complaints.
Evaluation (LLM eval)
Measuring whether the AI does what it should. Production-grade eval has three layers: offline regression on a frozen test set (every release), online quality sampling (human-rated in production), and incident-rate tracking (escalations and rejections). Without all three you cannot tell whether a model update improved or regressed the system.
Slavin AI brand rule on certifications
Slavin AI / SLAtech LTD does not hold ISO 27001, SOC 2, ISO 42001 or any other formal AI / security certification. Engagements are advisory and architectural, not audit-house equivalents. Educational discussion of these frameworks on this site is content about the field, not a self-cert claim.