feat(evaluators): ATR regex-based threat detection evaluator

### Problem

Agent Control's evaluator ecosystem has Cisco AI Defense (cloud API) and Galileo Luna (LLM-based), but no **local, regex-based** evaluator for detecting known AI agent threat patterns without API keys or network calls.

### Proposed solution

A contrib evaluator using [ATR (Agent Threat Rules)](https://github.com/anthropics/agent-threat-rules) — community-maintained regex rules for AI agent threats.

```python
# Usage
from agent_control_evaluator_atr.threat_rules import ATREvaluator, ATRConfig

evaluator = ATREvaluator(ATRConfig(
    min_severity="medium",
    categories=["prompt-injection", "tool-poisoning"],
))
result = await evaluator.evaluate("Ignore all previous instructions...")
# EvaluatorResult(matched=True, confidence=0.9, metadata={findings: [...]})
```

**Key characteristics:**
- `atr.threat_rules` evaluator name, auto-discovered via entry points
- 20 rules, 306 patterns covering OWASP Agentic Top 10
- Configurable: `min_severity`, `categories` filter, `block_on_match`, `on_error` (fail-open/closed)
- Pure regex, no API keys, <5ms evaluation
- Returns all matching rules (not just first match) with metadata
- Follows the Cisco evaluator pattern exactly (pyproject.toml, Makefile, entry points)
- Rules maintained at [agentthreatrule.org](https://agentthreatrule.org) (MIT licensed)
- ATR is already used by [Cisco AI Defense](https://github.com/cisco-ai-defense/mcp-scanner/pull/79)

### Willingness to contribute

Yes — full implementation ready with 22 tests covering detection, false-positive safety, config options, error handling, and multi-match behavior. Happy to submit a PR.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(evaluators): ATR regex-based threat detection evaluator #169

Problem

Proposed solution

Willingness to contribute

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

feat(evaluators): ATR regex-based threat detection evaluator #169

Description

Problem

Proposed solution

Willingness to contribute

Metadata

Metadata

Assignees

Labels

Type

Fields

Projects

Milestone

Relationships

Development

Issue actions