Fix prompt template path listing when CWD differs from project root by simonrosenberg · Pull Request #470 · OpenHands/benchmarks

simonrosenberg · 2026-03-02T20:24:03Z

Summary

Fix ValueError in prompt template choice listing when the current working directory is not the project root
Changed p.relative_to(Path.cwd()) → str(p) in all 6 run_infer.py files
The prompt directory is already resolved to an absolute path, so str(p) gives a reliable absolute path regardless of CWD

Files changed

File	Change
`benchmarks/commit0/run_infer.py`	`p.relative_to(Path.cwd())` → `str(p)`
`benchmarks/multiswebench/run_infer.py`	`p.relative_to(Path.cwd())` → `str(p)`
`benchmarks/swebench/run_infer.py`	`p.relative_to(Path.cwd())` → `str(p)`
`benchmarks/swebenchmultimodal/run_infer.py`	`p.relative_to(Path.cwd())` → `str(p)`
`benchmarks/swefficiency/run_infer.py`	`p.relative_to(Path.cwd())` → `str(p)`
`benchmarks/swtbench/run_infer.py`	`p.relative_to(Path.cwd())` → `str(p)`

Context

Extracted from #455. The old code used p.relative_to(Path.cwd()) to build the --prompt-template argparse choices list. This raises ValueError when CWD is not an ancestor of the prompt directory (e.g., when running benchmarks as an installed package or from a different directory).

Validation

All benchmarks pass with this change — see CI results on #455:

Benchmark	Status	Run
swebench	✅	run
swtbench	✅	run
commit0	✅	run
swebenchmultimodal	✅	run
gaia	✅	run

Test plan

Verify benchmarks can be launched from a directory other than the project root
Verify --prompt-template argument still works with the listed choices

🤖 Generated with Claude Code

`p.relative_to(Path.cwd())` raises ValueError when the current working directory is not an ancestor of the prompt directory. Use `str(p)` with the already-resolved absolute path instead. Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>

all-hands-bot

🟡 Acceptable - Fixes a real bug but there's a cleaner solution.

VERDICT: ✅ Worth merging - Core logic is sound, improvement suggested

KEY INSIGHT: You're solving the right problem (CWD fragility) but using absolute paths in help text degrades UX when a simpler solution exists.

benchmarks/swebench/run_infer.py

all-hands-bot · 2026-03-02T20:26:23Z

benchmarks/swebench/run_infer.py

 def main() -> None:
    prompt_dir = (Path(__file__).parent / "prompts").resolve()
-    choices = [str(p.relative_to(Path.cwd())) for p in prompt_dir.glob("*.j2")]
+    choices = [str(p) for p in prompt_dir.glob("*.j2")]


🟢 Code Smell: This identical change in 6 files suggests you could extract get_prompt_choices(prompt_dir) to a shared utility in benchmarks/utils/. Not critical, but reduces duplication and makes future fixes easier.

all-hands-bot

Additional observation on code structure:

all-hands-bot · 2026-03-02T20:26:38Z

benchmarks/commit0/run_infer.py

 def main() -> None:
    prompt_dir = (Path(__file__).parent / "prompts").resolve()
-    choices = [str(p.relative_to(Path.cwd())) for p in prompt_dir.glob("*.j2")]
+    choices = [str(p) for p in prompt_dir.glob("*.j2")]


🟢 Nit - Code Duplication: This exact pattern (prompt directory setup + choices list + argparse) appears identically in all 6 benchmark files. Consider extracting to benchmarks/utils/prompts.py:

def get_prompt_arg_parser(script_file: Path) -> tuple[list[str], Path]: """Get prompt choices and default path for argparse.""" prompt_dir = (script_file.parent / "prompts").resolve() choices = [p.name for p in prompt_dir.glob("*.j2")] default = prompt_dir / "default.j2" assert default.exists(), f"Default prompt {default} not found" return choices, default

Not critical for this PR, but would make the next fix easier (you'd update one place instead of six).

Co-authored-by: OpenHands Bot <contact@all-hands.dev>

all-hands-bot reviewed Mar 2, 2026

View reviewed changes

Update benchmarks/swebench/run_infer.py

f16ec4e

Co-authored-by: OpenHands Bot <contact@all-hands.dev>

simonrosenberg closed this Mar 3, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix prompt template path listing when CWD differs from project root#470

Fix prompt template path listing when CWD differs from project root#470
simonrosenberg wants to merge 2 commits intomainfrom
fix/prompt-path-relative-to-cwd

simonrosenberg commented Mar 2, 2026

Uh oh!

all-hands-bot left a comment

Uh oh!

Uh oh!

all-hands-bot Mar 2, 2026

Uh oh!

all-hands-bot left a comment

Uh oh!

all-hands-bot Mar 2, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

simonrosenberg commented Mar 2, 2026

Summary

Files changed

Context

Validation

Test plan

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

all-hands-bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

all-hands-bot left a comment

Choose a reason for hiding this comment

Uh oh!

all-hands-bot Mar 2, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants