Question 1

What AI systems do you test?

Accepted Answer

We test LLM-based apps, chatbots, AI assistants, ML-driven features, and AI integrations including retrieval-augmented generation (RAG), tool-calling, and multi-agent workflows.

Question 2

Do you test AI agents, coding assistants, and MCP servers?

Accepted Answer

Yes. We test the AI agents and integrations your organisation runs internally, focused on what they can access and how that access could be abused: MCP server permissions, scoped keys versus prompt-based guards, hook and validation bypasses, and secret exposure in fast-built applications.

Question 3

Is a prompt instruction enough to control an AI agent?

Accepted Answer

No. A prompt is a suggestion, not a security boundary. An agent can route around an instruction, including by writing and running a script to do something it was told not to do directly. Real controls live in scoped permissions and restricted credentials, which is exactly what we test.

Question 4

Can you test AI features in a multi-tenant SaaS platform?

Accepted Answer

Yes. For SaaS products we focus on what matters most when an AI assistant acts on a user's behalf: tenant isolation, authorization and rights checks inside the assistant, privilege escalation through the AI layer, excessive agency, unauthorized or unintended tool calls, and resource-consumption abuse that could affect platform stability. The AI should only ever act within the permissions of the user invoking it, and we verify that this holds under attack.

Question 5

Do you test prompt injection and jailbreaks?

Accepted Answer

Yes. We assess direct and indirect prompt injection, instruction override, jailbreaks of system prompts, RAG poisoning, model confusion, and guardrail bypass paths, including tool-call abuse and data exfiltration through AI features. We work from the OWASP Top 10 for LLM Applications and the OWASP Top 10 for Agentic Applications (2026), extended with context-specific attack chains, and every finding is manually validated by our ethical hackers rather than AI-only scanning.

Question 6

Does your AI pentest satisfy EU AI Act Article 15 requirements?

Accepted Answer

Yes. High-risk AI systems under Annex III of the EU AI Act must comply with Article 15 cybersecurity requirements by 2 December 2027 (stand-alone systems) or 2 August 2028 (AI embedded in regulated products under Annex I), following the Digital Omnibus agreement of 7 May 2026. Our AI penetration test produces findings mapped to Article 15 obligations, giving your compliance team auditable evidence of conformity.

Question 7

Can you test for data leakage?

Accepted Answer

Yes. We test for unintended disclosure of sensitive data via model outputs, memory, retrieval sources, file and tool access, and other AI interfaces.

Question 8

Can this be combined with a classic application pentest?

Accepted Answer

Absolutely. Many AI risks live in the surrounding application, APIs, authentication, and integrations. We can combine both assessments for a complete view and a single unified evidence package.

Question 9

Do you provide an AI Cyber Security Risk Assessment?

Accepted Answer

Yes. Our AI Cyber Security Risk Assessment maps your AI models, agents, and integrations against ETSI EN 304 223, the first European standard defining baseline cybersecurity requirements for AI systems, built on the NCSC/CISA Guidelines for Secure AI System Development. We cover its 13 principles across the full lifecycle, from secure design to secure end of life, and combine the assessment with hands-on penetration testing so every identified risk is validated by our ethical hackers instead of remaining a paper exercise.

What is AI Systems Penetration Testing?

What is the testing scope?

How do we approach AI testing?

Prompt Analysis

Guardrail Testing

Agent and MCP permissions

Coding agents and agentic chains

Data Leakage

Practitioners who use AI themselves

What does EU AI Act Article 15 require?

What is a high-risk AI system?

Article 15: what it requires

Deadline: 2 December 2027

Audit evidence for conformity

How does an AI pentest differ from a classic pentest?

What's the same

What's unique to AI

Why you need both

Frequently Asked Questions

Further Reading

Assess your AI security posture