Add Prompt Duel Optimizer (PDO) - Label-Free Prompt Optimization #53

heyjustinai · 2025-10-23T18:15:48Z

What does this PR do?

Introduces Prompt Duel Optimizer (PDO) - a label-free prompt optimization strategy based on our research paper (arXiv:2510.13907). PDO uses dueling bandits and Thompson sampling to optimize prompts without requiring labeled validation data.

Key Features:

Label-free optimization using LLM judge for pairwise comparisons
Double Thompson Sampling for efficient prompt selection
Top-performer guided mutation for prompt evolution
Outperforms baselines on BIG-bench Hard and MS MARCO

What's Included:

Core PDO implementation in src/prompt_ops/core/pdo/
Example config: configs/pdo-example.yaml
Use case: use-cases/web-of-lies-pdo/ (logical reasoning benchmark)
Updated README and documentation

Test Plan

cd use-cases/web-of-lies-pdo
prompt-ops migrate --config config.yaml

Full test: Run with default 30 rounds for complete benchmark results.

…on engine

…ation

… bandit approach

…dits and Thompson sampling

… ranking systems, and meta prompts for enhanced prompt optimization

…installation and clarify PyPI naming transition

…DME, including paper link and use case demonstration

heyjustinai added 10 commits October 23, 2025 10:29

feat: add LiteLLM model adapter for inference

7d3bc92

feat: introduce centralized meta prompt templates for QPDO optimizati…

2aacb4d

…on engine

feat: implement ranking systems and Thompson sampling for PDO optimiz…

03676eb

…ation

feat: add PDO module and optimization engine with support for dueling…

372d57c

… bandit approach

feat: implement PDOStrategy for prompt optimization using dueling ban…

5c15352

…dits and Thompson sampling

feat: rename and PDO module components including optimization engine,…

53fa826

… ranking systems, and meta prompts for enhanced prompt optimization

usecase: add configuration and dataset for Web of Lies PDO optimization

dc339ec

docs: update installation instructions in README to recommend source …

654a411

…installation and clarify PyPI naming transition

docs: add announcement for the new Prompt Duel Optimizer (PDO) in REA…

0237a99

…DME, including paper link and use case demonstration

feat: added eval for Web of Lies PDO prompt optimization

72eb96c

meta-cla bot added the CLA Signed This label is managed by the Meta Open Source bot. label Oct 23, 2025

heyjustinai merged commit b937782 into main Oct 23, 2025
9 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Prompt Duel Optimizer (PDO) - Label-Free Prompt Optimization #53

Add Prompt Duel Optimizer (PDO) - Label-Free Prompt Optimization #53

Uh oh!

heyjustinai commented Oct 23, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add Prompt Duel Optimizer (PDO) - Label-Free Prompt Optimization #53

Add Prompt Duel Optimizer (PDO) - Label-Free Prompt Optimization #53

Uh oh!

Conversation

heyjustinai commented Oct 23, 2025

What does this PR do?

Test Plan

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants