Tag

#ai-safety

3 posts tagged ai-safety.

Tools

What Is Garak LLM Scanner? A Practitioner's Guide to NVIDIA's Open-Source LLM Vulnerability Tool

Garak is NVIDIA's open-source LLM vulnerability scanner that red-teams language models for jailbreaks, prompt injection, hallucination, data leakage, and
June 20, 2026
guardrails

Choosing an LLM Guardrail: Llama Guard, NeMo Guardrails, Guardrails AI

A decision guide for picking an LLM guardrail in 2026 — Meta's Llama Guard 4, NVIDIA's NeMo Guardrails, and Guardrails AI.
May 19, 2026
guardrails

Classifier-on-Output: Catching Misbehavior Post-Generation

How production teams use post-generation classifiers to catch what input filters and refusal training miss — architectures, tradeoffs, and where output
May 6, 2026