All posts
-
What Is Garak LLM Scanner? A Practitioner's Guide to NVIDIA's Open-Source LLM Vulnerability Tool
Garak is NVIDIA's open-source LLM vulnerability scanner that red-teams language models for jailbreaks, prompt injection, hallucination, data leakage, and
-
Best LLM Vulnerability Scanners 2026: Garak, PyRIT, Promptfoo, and Mindgard Compared
A practitioner's guide to the best LLM vulnerability scanners in 2026 — Garak v0.15.0, PyRIT, Promptfoo (now OpenAI), and Mindgard.
-
Open Source LLM Red Teaming Tools: PyRIT, Garak, HarmBench, and What to Use When
A practitioner's guide to the main open source LLM red teaming tools — PyRIT, Garak, HarmBench, TextAttack — what each does, what it misses, and how to
-
Automated LLM Red-Teaming in CI: garak vs PyRIT vs Promptfoo
Three open-source tools can gate your pipeline on LLM security findings — garak, PyRIT, and Promptfoo. A practitioner comparison of how each fits CI/CD
-
Choosing an LLM Guardrail: Llama Guard, NeMo Guardrails, Guardrails AI
A decision guide for picking an LLM guardrail in 2026 — Meta's Llama Guard 4, NVIDIA's NeMo Guardrails, and Guardrails AI.
-
Prompt Injection Detectors: Rebuff vs Vigil vs LLM Guard
Compare open-source prompt injection detectors Rebuff, Vigil, and LLM Guard on detection architecture, maintenance status, and which one to actually
-
LLM Guard: Input and Output Scanning for Production LLM Apps
A practical breakdown of LLM Guard by Protect AI — its input and output scanners, how the sanitize/scan pipeline works, where it fits as a runtime
-
PyRIT: Microsoft's AI Red-Teaming Framework, Explained
A technical breakdown of PyRIT, Microsoft's Python Risk Identification Tool for generative AI — its target/dataset/orchestrator/converter/scorer
-
False Positive Cost in Refusal Systems: Measure and Tune
Practical methods for quantifying the cost of refusal false positives in LLM products — eval design, baseline rates, threshold tuning, and the regression
-
Best LLM Security Scanners: Open-Source and Enterprise Compared
A practitioner's comparison of the best LLM security scanners — Garak, PyRIT, LLM Guard, Promptfoo, Vigil, and enterprise options.
-
Garak LLM Vulnerability Scanner: How It Works and When to Use It
A technical breakdown of the garak LLM vulnerability scanner — its probe architecture, supported attack categories, CLI workflow, and how it fits into a
-
Classifier-on-Output: Catching Misbehavior Post-Generation
How production teams use post-generation classifiers to catch what input filters and refusal training miss — architectures, tradeoffs, and where output
-
Llama Guard vs NeMo vs OpenAI Moderation: Production Tradeoffs
A practitioner comparison of Llama Guard, NeMo Guardrails, and the OpenAI Moderation API — coverage, latency, customization, and where each one breaks in