Tag
#ai-safety
3 posts tagged ai-safety.
- Tools
What Is Garak LLM Scanner? A Practitioner's Guide to NVIDIA's Open-Source LLM Vulnerability Tool
Garak is NVIDIA's open-source LLM vulnerability scanner that red-teams language models for jailbreaks, prompt injection, hallucination, data leakage, and
- guardrails
Choosing an LLM Guardrail: Llama Guard, NeMo Guardrails, Guardrails AI
A decision guide for picking an LLM guardrail in 2026 — Meta's Llama Guard 4, NVIDIA's NeMo Guardrails, and Guardrails AI.
- guardrails
Classifier-on-Output: Catching Misbehavior Post-Generation
How production teams use post-generation classifiers to catch what input filters and refusal training miss — architectures, tradeoffs, and where output