Skip to content

Add Detection Guidance: false positives, human-writing signs, LLM idiolects#113

Open
philippdubach wants to merge 1 commit into
blader:mainfrom
philippdubach:detection-guidance
Open

Add Detection Guidance: false positives, human-writing signs, LLM idiolects#113
philippdubach wants to merge 1 commit into
blader:mainfrom
philippdubach:detection-guidance

Conversation

@philippdubach
Copy link
Copy Markdown

Summary

Most of this skill tells the editor what to remove. This adds the inverse — what to leave alone, and how to decide.

Sourced from Wikipedia: Signs of AI writing (revision fetched 2026-05-01), specifically the "Ineffective indicators", "Signs of human writing", and "Differences between LLMs" sections.

Three subsections, no new patterns and no version bump:

  • What NOT to flag (false positives) — indicators that look AI-coded but are actually neutral: perfect grammar, em dashes alone, curly quotes alone, formal vocabulary, common transition words. The over-editing risk is real — if the skill is applied too aggressively, it strips legitimate prose. Closes with the "clusters matter, isolated signs don't" rule.
  • Signs of human writing (preserve these) — positive markers that should be left untouched: specific detail, mixed feelings, era-bound references, sentence-length variation, parenthetical self-corrections, and the November 30, 2022 cutoff for ruling out AI involvement entirely.
  • LLM Idiolects — quick triage notes per model family (ChatGPT/Grok verbose with artifacts; Gemini/Claude concise, no curly quotes by default). Tendencies, not rules.

The new section sits between §29 (Fragmented Headers) and the Process section. README and WARP.md are unchanged — the README pattern table is unaffected since this is meta-guidance, not new patterns.

Why this matters

The current skill is a strong "what to fix" guide but doesn't push back against over-editing. In practice, applying any pattern-based rewriter too aggressively is the failure mode — humans get their voice flattened. A short "what NOT to flag" sidebar plus a positive-signals list keeps the editor honest.

This PR is independent of #111 (era vocab + gap-filling) and #112 (patterns 30–34), and can land on its own without rebasing.

Test plan

  • Single-file change to SKILL.md, +45 lines
  • Pattern numbering and existing sections unchanged
  • New section is purely additive (no modified pattern content)
  • Skill loads in Claude Code with no parse errors

Source: Wikipedia:Signs of AI writing.

…olects

Most of this skill tells the editor what to remove. This adds the
inverse — what to leave alone, and how to decide.

Sourced from Wikipedia: Signs of AI writing (revision fetched
2026-05-01), specifically the "Ineffective indicators", "Signs of
human writing", and "Differences between LLMs" sections.

Three subsections, no new patterns:

- "What NOT to flag (false positives)" — the indicators that look
  AI-coded but are actually neutral (perfect grammar, em dashes
  alone, curly quotes alone, formal vocabulary, common transition
  words). The over-editing risk is real: if the skill is applied
  too aggressively, it strips legitimate prose. Closes with the
  "clusters matter, isolated signs don't" rule.

- "Signs of human writing (preserve these)" — positive markers
  that should be left untouched: specific detail, mixed feelings,
  era-bound references, sentence-length variation, parenthetical
  self-corrections, and the November 30, 2022 cutoff for ruling
  out AI involvement entirely.

- "LLM Idiolects" — quick triage notes per model family
  (ChatGPT/Grok verbose with artifacts; Gemini/Claude concise, no
  curly quotes by default). Tendencies, not rules.

No pattern-count change. No README changes (the README's pattern
table is unaffected since this section is meta-guidance, not new
patterns). No version bump.
philippdubach added a commit to philippdubach/humanizer that referenced this pull request May 1, 2026
Brings the fork's main branch in line with the maintained local
v2.6.0, consolidating the changes that are also opened as focused
PRs against blader/humanizer (blader#111, blader#112, blader#113):

- §7 expanded with era-specific AI vocabulary clusters (GPT-4 /
  GPT-4o / GPT-5 eras), plus 'bolstered' and 'meticulous' added to
  the master list and a literal-vs-figurative caveat.
- §21 renamed to "Knowledge-Cutoff Disclaimers and Speculative
  Gap-Filling"; covers the retrieval-augmented "maintains a low
  profile" / "keeps personal details private" speculation pattern.
- New patterns §30-34: reference-markup artifacts (turn0search0,
  oaicite, utm_source=chatgpt.com, etc.), placeholder leftovers,
  Markdown/wikitext contamination, formal "Conclusion" closers,
  didactic disclaimers.
- New Detection Guidance group: what NOT to flag (false positives),
  signs of human writing to preserve, and per-model LLM idiolects.

Frontmatter version bumped to 2.6.0. README pattern table updated
(29 → 34 patterns) with a new Artifacts and Contamination section
and a pointer to Detection Guidance. WARP.md count corrected from
the stale "25 patterns" to 34.

Sourced from Wikipedia: Signs of AI writing (revision fetched
2026-05-01).
duathron added a commit to duathron/humanizer-ext that referenced this pull request May 22, 2026
- Add DETECTION GUIDANCE section (false positives, human-writing
  signs, LLM idiolects) so editors know what NOT to flag (PR blader#113)
- Add Tier-1 AI-iness density pre-flight in Full mode; auto-drops to
  Quick when density = 0 to protect human-first drafts (PR blader#115 adapted)
- Expand blader#7 with era-specific vocabulary clusters (GPT-4 / GPT-4o /
  GPT-5 eras) and figurative-vs-literal caveat (PR blader#111)
- Expand blader#9 with "rather than" dismissals + on-the-table test (PR blader#85)
- Expand blader#14 with paired em dash bracketing + 4 fix options (PR blader#85)
- Expand blader#21 with speculative gap-filling ("maintains a low profile"
  template detection) (PR blader#111)
- Expand blader#23 with three more didactic disclaimers (subsumes pattern
  34 from PR blader#112)
- Expand blader#25 with structural "## Conclusion" section note
- Add pattern blader#35 Debunking-Pose Headings -- heading-level AI tells
  that slip through prose-only passes (PR blader#116)
- Add patterns blader#36 Conditional Frame Stacking and blader#37 Miscalibrated
  Epistemic Confidence (PR blader#85)
- Add patterns blader#38 Reference-Markup Artifacts, blader#39 Phrasal Templates /
  Placeholder Text, and blader#40 Markdown / Wikitext Contamination --
  three chat-UI copy-paste tells that confirm AI involvement (PR blader#112)
- Extend domain overrides for blader#35-37; blader#38-40 are universal
- Extend final AI audit from 9 to 13 points
- README: pattern count 34 -> 40, three new section rows, updated
  fork-differentiator table, 3.2.0 version-history entry

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant