trpc-group · YAO-001 · Jul 4, 2026 · Jul 4, 2026 · Jul 4, 2026 · Jul 4, 2026
diff --git a/.github/PULL_REQUEST_TEMPLATE.md b/.github/PULL_REQUEST_TEMPLATE.md
@@ -0,0 +1,63 @@
+## Summary
+
+Implements Issue #90 Tool Script Safety Guard as an opt-in pre-execution guard
+for Python and Bash tool scripts.
+
+## Issue #90 Acceptance Checklist
+
+- [ ] Scans script/command text, command-line arguments, cwd, env, and tool metadata.
+- [ ] Produces `allow`, `deny`, and `needs_human_review` decisions.
+- [ ] Supports Python and Bash scanners.
+- [ ] Supports YAML policy configuration, including strict validation.
+- [ ] Emits structured reports with decision, risk type, rule, evidence, and recommendation.
+- [ ] Writes sanitized audit JSONL events and OpenTelemetry attributes.
+- [ ] Includes manifest-driven samples with high-risk detection >= 90%.
+- [ ] Covers secret-read, dangerous-delete, and non-whitelist-network samples with no allow decisions.
+- [ ] Keeps 500-line script scanning under 1 second in the safety test suite.
+- [ ] Documents that static scanning is not a sandbox.
+- [ ] Preserves default behavior for existing Tool and CodeExecutor paths.
+
+## Code Path Mapping
+
+- Scanner, rules, policy, reports: `trpc_agent_sdk/tools/safety/`
+- CLI: `scripts/tool_safety_check.py`
+- Manifest report generation: `scripts/tool_safety_manifest_report.py`
+- Samples and policy: `examples/tool_safety/`
+- Safety tests: `tests/tools/safety/`
+
+## Validation
+
+```bash
+python -m pytest tests/tools/safety -q
+python scripts/tool_safety_manifest_report.py --strict-policy
+python scripts/tool_safety_check.py \
+  examples/tool_safety/samples/safe_bash.sh \
+  --language bash \
+  --policy examples/tool_safety/policy.yaml
+python scripts/tool_safety_check.py \
+  examples/tool_safety/samples/bash_pipe_exfiltration.sh \
+  --language bash \
+  --policy examples/tool_safety/policy.yaml
+```
+
+## Sample Matrix
+
+- Sample count: 52
+- Decision matches: 52/52
+- Required rule matches: 52/52
+- Categories include safe, secret-read, dangerous-delete, non-whitelist-network,
+  secret-exfiltration, dynamic-code, resource-exhaustion, and process execution.
+
+## Compatibility
+
+- `BashTool` safety guard remains disabled by default.
+- `UnsafeLocalCodeExecutor` safety guard remains disabled by default.
+- `needs_human_review` is not blocked unless `block_on_review=True`.
+
+## Known Limitations
+
+This is a deterministic static pre-execution guard, not a sandbox. It cannot
+guarantee safety against obfuscation, generated code, external binary behavior,
+runtime-only data flow, or interpreter/runtime bugs. Production deployments
+still need filesystem isolation, network egress control, resource limits, and
+runtime audit monitoring.
diff --git a/examples/tool_safety/PR_DESCRIPTION.md b/examples/tool_safety/PR_DESCRIPTION.md
@@ -0,0 +1,103 @@
+# Tool Script Safety Guard - Issue #90
+
+## Acceptance Mapping
+
+- Scans script/command content, command-line args, cwd, env metadata, and tool metadata.
+- Returns `allow`, `deny`, or `needs_human_review`.
+- Supports Python AST/text checks and Bash token/text checks.
+- Loads policy from YAML and supports strict policy validation.
+- Emits structured reports with decision, risk type, rule, evidence, and recommendation.
+- Writes sanitized audit JSONL and records OpenTelemetry safety attributes.
+- Provides a manifest-driven sample corpus with at least 12 samples.
+- Maintains high-risk detection at or above 90%.
+- Keeps secret-read, dangerous-delete, and non-whitelisted-network samples from allowing execution.
+- Keeps 500-line Bash and Python scripts under 1 second in the safety test suite.
+- Documents that static scanning is not a sandbox.
+- Keeps existing Tool and CodeExecutor behavior unchanged unless explicitly enabled.
+
+## Code Path Mapping
+
+- Scanner: `trpc_agent_sdk/tools/safety/_scanner.py`, `trpc_agent_sdk/tools/safety/_rules.py`
+- Policy: `trpc_agent_sdk/tools/safety/_policy.py`
+- Input extraction: `trpc_agent_sdk/tools/safety/_extractors.py`
+- Filter/Wrapper: `trpc_agent_sdk/tools/safety/_filter.py`, `trpc_agent_sdk/tools/safety/_wrapper.py`
+- BashTool integration: `trpc_agent_sdk/tools/file_tools/_bash_tool.py`
+- UnsafeLocalCodeExecutor integration: `trpc_agent_sdk/code_executors/local/_unsafe_local_code_executor.py`
+- CLI: `scripts/tool_safety_check.py`
+- Manifest report: `scripts/tool_safety_manifest_report.py`
+- Manifest and samples: `examples/tool_safety/samples/manifest.yaml`, `examples/tool_safety/samples/`
+- Reports: `examples/tool_safety/all_reports.json`
+- Audit: `trpc_agent_sdk/tools/safety/_audit.py`
+- OTel: `trpc_agent_sdk/tools/safety/_telemetry.py`
+- Custom rules API: `trpc_agent_sdk/tools/safety/_custom_rules.py`
+- Tests: `tests/tools/safety/`
+
+## Sample Corpus
+
+Current manifest size: 52 samples.
+
+Category counts:
+
+- `dangerous_delete`: 5
+- `denied_path_write`: 1
+- `dependency_install`: 1
+- `dynamic_code`: 2
+- `dynamic_delete`: 1
+- `dynamic_network`: 1
+- `network_non_whitelist`: 7
+- `network_whitelist`: 2
+- `process_control`: 1
+- `process_execution`: 1
+- `resource_exhaustion`: 5
+- `safe_local`: 7
+- `secret_exfiltration`: 8
+- `secret_output`: 2
+- `secret_read`: 6
+- `shell_features`: 1
+- `shell_injection`: 1
+
+## Validation Commands
+
+```bash
+pytest tests/tools/safety
+python scripts/tool_safety_manifest_report.py --strict-policy
+python scripts/tool_safety_check.py \
+  examples/tool_safety/samples/dangerous_delete.sh \
+  --language bash \
+  --policy examples/tool_safety/tool_safety_policy.yaml \
+  --strict-policy
+python scripts/tool_safety_check.py \
+  examples/tool_safety/samples/safe_python.py \
+  --language python \
+  --policy examples/tool_safety/tool_safety_policy.yaml \
+  --strict-policy
+```
+
+`examples/tool_safety/all_reports.json` is generated by:
+
+```bash
+python scripts/tool_safety_manifest_report.py --strict-policy
+```
+
+It is a deterministic normalized artifact: report `scan_id` and telemetry
+scan id are pinned to `manifest:<file>`, `timestamp` is pinned to
+`1970-01-01T00:00:00+00:00`, and elapsed duration fields are pinned to `0.0`
+before writing the committed JSON.
+
+## Default Compatibility
+
+- `BashTool` does not enable the safety guard by default.
+- `UnsafeLocalCodeExecutor` does not enable the safety guard by default.
+- Filter, Wrapper, Skill-like callable, and MCP-like callable payload paths are opt-in.
+- `needs_human_review` is not blocked by default unless `block_on_review=true`.
+
+## Known Limitations
+
+This is a deterministic static pre-execution guard, not a sandbox.
+
+It does not replace process sandboxing, least-privilege filesystem permissions,
+network egress controls, resource limits, or runtime audit and monitoring.
+
+Obfuscation, generated code, dynamic imports, external binary behavior, and
+environment-dependent behavior are handled conservatively where possible and may
+require human review.