Silmaril

The world's first self-healing prompt injection defense

Book a Demo

SILMARIL HACKED

MicrosoftOpenAIAnthropicGooglePerplexityDropbox

Problem

01 // PUBLIC INPUTS

Public inputs are now agent attack surfaces

ShareLeak and PipeLeak used public fields to hijack Copilot and Agentforce agents, leaking SharePoint and CRM data by email. CVE-2026-21520 · CVSS 7.5

02 // TRUSTED CONTEXT

Trusted context can carry hidden instructions

CurseChain used hidden README comments in Cursor to steal SSH keys and poison later projects with regenerated exfiltration code.

03 // OPEN WEB

The open web is becoming an attack delivery layer

Forcepoint and Google found live website payloads for API-key theft, financial fraud, data destruction, and agent denial of service.

Solution

Secure AI execution

Silmaril firewall

Silmaril wraps your inference calls to block harmful outcomes before they materialize.

Claude Code
Parallel coding agents
OpenClaw
Autonomous service fleets

Approach

01 // ATTACK

Finding vulnerabilities before attackers do

Autonomous agents probe your product through the UI, map trust boundaries, and chain prompt injection, tool abuse, and context poisoning into working exploits.

02 // PROTECT

Blocking attacks in real-time

A low-latency firewall classifier learns your application traces and blocks risky user intent, tool calls, context, and accumulated state in real time.

03 // RETRAIN

Turning every attack into a deployed defense

Every discovered attack becomes synthetic training data, updating defenses in under an hour and sharing anonymized protections across deployments.

Performance

Production attack data

131 attack techniques

Evaluated against production attack traces across prompt injection, tool misuse, and policy bypass chains.

Beyond input filtering

Silmaril reads intent, app context, and execution state as one signal, catching harmful outcomes before they complete.

Threat hunting closes the loop

Exploits found by agents in your environment become retraining signal, hardening defenses ahead of copycat probes.

Application-layer deployment

A small SDK wrapper works across major agent stacks, with managed or self-hosted controls and node-level blocking.

Threats Blocked

15 critical vulnerabilities disclosed to OpenAI, Anthropic, Google, and Microsoft in two weeks.

CASE STUDY
$68Mdamages prevented

Silmaril found the exploits, retrained the firewall, and blocked attack chains spanning:

  • Self-replicating worm propagation via document poisoning
  • Agent-to-agent supply chain compromise
  • Sandbox credential theft leading to cross-user remote code execution
  • Zero-click data exfiltration through calendar injection
  • Silent document and message harvesting via email injection
#1 AI-native productivity appCASE STUDY
CASE STUDY // HIGH
$20Mdamages prevented

Silmaril found the exploits, retrained the firewall, and blocked privilege abuse before production exposure.

  • Entity injection via feedback fields into agent context
  • Unauthorized workflow execution through tool-manipulation payloads
#1 AI-native analytics platformCASE STUDY
REPORT // CRITICAL
<5 minexploit execution

Silmaril hacked the ChatGPT agent by chaining prompt injection into escalated root access, lateral container movement, and source-code exposure.

OpenAIREPORT
REPORT // CRITICAL
Millionsof users patched

Critical prompt injection vulnerabilities using email as the entry vector, achieving data exfiltration through SSRF in Copilot. Microsoft patched the vulnerability for millions of users.

MicrosoftREPORT

FAQ

Guardrails pattern match against known attack signatures and fall behind as new techniques emerge. They evaluate inputs in isolation and cannot see attacks that emerge from the interaction between an agent, its tools, and its context. Silmaril's multihead classifier inspects user intent, application context, and execution states together, so it catches indirect injection, multi-turn chains, context poisoning, and confused deputy attacks that pattern-matching approaches miss entirely.

Win the Arms Race

Only adaptive defenses outpace AI augmented attackers.

Book a Demo