Question 1

What types of threats does AIProxyGuard detect?

Accepted Answer

AIProxyGuard detects multiple threat categories: Prompt Injection (instruction hijacking), Jailbreak (safety bypass attempts), PII Detection (sensitive data leakage), Harmful Content, Social Engineering, Unicode Evasion (obfuscation attacks), and Child Safety filtering for both requests and responses. Each category has dedicated signatures and ML models.

Question 2

What are the different ways to use AIProxyGuard?

Accepted Answer

Two integration methods: (1) Docker Proxy — run locally as a reverse proxy with bundled signatures, or connect to our cloud for automatic signature updates and advanced ML models. (2) SDK — integrate our Python or Node.js SDK to call our hosted API and verify prompts before sending to your LLM. No Docker required, works with any architecture.

Question 3

Is AIProxyGuard free to use?

Accepted Answer

Yes. AIProxyGuard is open source under the Apache 2.0 license.

Docker Proxy: Self-host with bundled signatures at no cost. Create an account on aiproxyguard.com to get automatic signature updates and cloud features.

SDK / Hosted API: Generous free tier that you can integrate directly into your software using our SDKs. Sign up in your dashboard to get an API key. Check the quickstart guide to get running in 30 seconds.

Question 4

How do I protect my chatbot from jailbreak attacks?

Accepted Answer

Deploy AIProxyGuard as a proxy in front of your LLM provider, or use our SDK to check user messages before sending them. Both methods scan for jailbreak patterns like DAN mode, persona exploits, and restriction bypasses — blocking them before they reach your chatbot.

Question 5

Can I use AIProxyGuard with RAG applications?

Accepted Answer

Absolutely. RAG apps are vulnerable to indirect injection from retrieved documents. With SDK: check the combined prompt before sending. With Proxy: all requests are automatically scanned including retrieved context.

Question 6

Which LLM providers does AIProxyGuard support?

Accepted Answer

All of them. The Docker proxy works with OpenAI, Anthropic, Azure OpenAI, AWS Bedrock, Google Vertex AI, Grok, OpenRouter, Ollama, and any OpenAI-compatible API. The SDK works with any provider since you control when to call your LLM.

Question 7

How much latency does AIProxyGuard add?

Accepted Answer

Docker Proxy (self-hosted): sub-millisecond with bundled signatures. SDK/Hosted API: under 10ms average. Both use optimized signatures and lightweight ML models. The added latency is imperceptible compared to LLM response times.

Question 8

Is my data stored or logged?

Accepted Answer

Docker Proxy (self-hosted): you control everything, run fully offline with bundled signatures. SDK/Hosted API: prompts processed in memory, not stored. We log metadata for analytics but never prompt content.

Question 9

Can I customize detection policies and sensitivity?

Accepted Answer

Yes. All users can adjust sensitivity thresholds and configure how threats are handled (block, warn, or log). For custom policies assigned to your fleet of servers, you'll need a Pro/Enterprise plan.

Question 10

How do I secure my ChatGPT integration?

Accepted Answer

Point your OpenAI SDK at the AIProxyGuard proxy, or use our SDK to check prompts before calling the ChatGPT API. Both methods detect jailbreaks (DAN mode, developer mode exploits) and prompt injection attempts that try to override your system instructions.

Works with GPT-4, GPT-4o, GPT-3.5-turbo, and any OpenAI-compatible endpoint. See the quickstart to get protected in 30 seconds.

Question 11

Does AIProxyGuard work with Claude and Anthropic?

Accepted Answer

Yes. The Docker proxy supports Anthropic's Claude API natively. AIProxyGuard detects Claude-specific jailbreaks like human turn exploits and constitutional AI bypasses, in addition to universal prompt injection patterns.

Question 12

What is the difference between prompt injection and jailbreak?

Accepted Answer

Prompt injection tricks the model into following attacker instructions instead of yours — like SQL injection but for LLMs. Jailbreak bypasses the model's built-in safety guardrails to generate harmful content. AIProxyGuard detects both with dedicated signatures and ML models.

Frequently Asked Questions

Still have questions?