Promote fraud-detection example (IEEE-CIS) to main by ZhengyaoJiang · Pull Request #154 · WecoAI/weco-cli

ZhengyaoJiang · 2026-06-08T14:52:23Z

Promotes the fraud-detection example (PR #140) from dev to main.

This is an example-only change — no pyproject.toml version bump, so the Release
workflow will detect no version change and skip the PyPI publish (release_needed=false).

End-to-end validated twice (two independent synthetic-data fixtures) on the merged
dev code: prepare_data.py → evaluate.py runs the full pipeline (time-split, V-corr
pruning, label-encode, stratified 100K/25K subsample, LightGBM) and emits a parseable
auc_roc: line for both the strict and loose variants. Lint (ruff) green.

Reproducible Weco example on the IEEE-CIS Fraud Detection Kaggle dataset (real Vesta payment transactions), mirroring the published case study. - examples/fraud-detection/ strict fit/transform API (FeatureBuilder + train_and_evaluate) that makes train/val leakage impossible by construction. - examples/fraud-detection-loose/ earlier single-file build_features(train_df, val_df) API, kept for comparison. End-to-end validated: prepare_data -> evaluate emits a parseable `auc_roc:` line through the full pipeline (time-split, V-corr pruning, label-encode, stratified 100K/25K subsample, LightGBM). Lint (ruff) green.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: c1a4588c10

ℹ️ About Codex in GitHub

Your team has set up Codex to review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

Codex can also answer questions or update the PR. Try commenting "@codex address that feedback".

chatgpt-codex-connector · 2026-06-08T14:55:05Z

+        )
+        return 1
+
+    auc = train_and_evaluate(X_train_t, y_train, X_val_t, y_val)


Keep validation labels out of editable model code

When users run the documented Model-only or Full-pipeline scopes, model.py is one of the files Weco rewrites, but this call gives that editable code y_val before it produces predictions/AUC. In that context the strict API does not actually prevent validation-label leakage: a candidate can train/tune on X_val, y_val or directly return an inflated score, so the reported optimization result can be invalid despite the evaluator being frozen.

Useful? React with 👍 / 👎.

chatgpt-codex-connector · 2026-06-08T14:55:05Z

+- **Run**:
+```bash
+cd examples/fraud-detection
+weco run --source train.py \


Point the quickstart at the strict example sources

This quickstart changes into examples/fraud-detection, but that directory has no train.py (it has features.py and model.py), and the CLI validation rejects missing source files. Users following the top-level README will fail before any evaluation runs; this should mirror the example README’s --sources features.py model.py command or point at fraud-detection-loose if train.py is intended.

Useful? React with 👍 / 👎.

ZhengyaoJiang merged commit 58b4b01 into main Jun 8, 2026
2 checks passed

chatgpt-codex-connector Bot reviewed Jun 8, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Promote fraud-detection example (IEEE-CIS) to main#154

Promote fraud-detection example (IEEE-CIS) to main#154
ZhengyaoJiang merged 1 commit into
mainfrom
dev

ZhengyaoJiang commented Jun 8, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Uh oh!

chatgpt-codex-connector Bot Jun 8, 2026

Uh oh!

chatgpt-codex-connector Bot Jun 8, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ZhengyaoJiang commented Jun 8, 2026

Uh oh!

Uh oh!

chatgpt-codex-connector Bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

chatgpt-codex-connector Bot Jun 8, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant