cyberdefence.wiki
MOHAMMAD SHADAB SHAIKH
I break AI Agents
so the bad guys can't.
Specialized Red Teaming and Prompt Injection research. I help companies find the gap between what their AI is told to do, and what it actually does in production.
π Gray Swan Arena PG Top 50
π‘οΈ NASA & Meta Acknowledged
00
Gray Swan Arena Proof of Work
ADVERSARIAL IMPACT
1,633
TOTAL AI MODEL BREAKS
CHALLENGE
BREAKS
RANK
Indirect Injection Q1 2026
340
#19
Safeguards
638
#21
Staged Attack
384
#26
Proving Ground
265
#28
01
Attack Specializations
π
Prompt Injection
Injecting malicious instructions into AI context windows to override system prompts and fully hijack agent behavior.
CRITICAL
πΈοΈ
Multi-Agent Exploitation
Poisoning parent-to-subagent delegation messages to bypass restrictions enforced only at the subagent level.
CRITICAL
π―
Goal Hijacking
Redirecting AI agents away from their intended purpose to perform unauthorized tasks through social engineering.
HIGH
π€
Session Spoofing
Manipulating AI agents into believing they serve a different authenticated user to expose private data across accounts.
CRITICAL
π
Fake Document Injection
Crafting counterfeit retrieved policy documents that AI agents treat as authoritative sources of truth.
HIGH
π
Capability Discovery
Extracting hidden tool names, function signatures, and internal architecture from AI agents through creative framing.
MEDIUM
02
Hacked & Secured
NASA
Β© 2026 MOHAMMAD SHADAB SHAIKH
OPEN TO AI SECURITY ROLES
BREAKING AI β ETHICALLY