Skills review-ai-writing

Name: review-ai-writing
Author: openclaw

Detect AI-generated writing patterns in developer text — docs, docstrings, commit messages, PR descriptions, and code comments. Use when reviewing any text artifact for authenticity and clarity.

install

source · Clone the upstream repo

git clone https://github.com/openclaw/skills

Claude Code · Install into ~/.claude/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.claude/skills && cp -r "$T/skills/anderskev/review-ai-writing" ~/.claude/skills/openclaw-skills-review-ai-writing && rm -rf "$T"

OpenClaw · Install into ~/.openclaw/skills/

T=$(mktemp -d) && git clone --depth=1 https://github.com/openclaw/skills "$T" && mkdir -p ~/.openclaw/skills && cp -r "$T/skills/anderskev/review-ai-writing" ~/.openclaw/skills/openclaw-skills-review-ai-writing && rm -rf "$T"

manifest: skills/anderskev/review-ai-writing/SKILL.md

Review AI Writing

Detect AI-generated writing patterns across developer text artifacts using parallel subagents.

Usage

/beagle-docs:review-ai-writing [--all] [--category <name>] [path]

Flags:

```
--all
```
- Scan entire codebase (default: changed files from main)

--category <name>

- Only check specific category:

content|vocabulary|formatting|communication|filler|code_docs

Path: Target directory (default: current working directory)

Instructions

1. Parse Arguments

Extract flags from

$ARGUMENTS

```
--all
```
- Full codebase scan
```
--category <name>
```
- Filter to specific category
Path - Target directory

2. Load Skills

Load required skills:

Skill(skill: "beagle-docs:review-ai-writing")
Skill(skill: "beagle-core:review-verification-protocol")

3. Determine Scope

# Default: changed files from main
git diff --name-only $(git merge-base HEAD main)..HEAD

# If --all flag: scan all text artifacts
find . -type f \( -name "*.md" -o -name "*.py" -o -name "*.ts" -o -name "*.tsx" -o -name "*.js" -o -name "*.jsx" -o -name "*.go" -o -name "*.rs" -o -name "*.java" -o -name "*.rb" -o -name "*.swift" -o -name "*.kt" -o -name "*.ex" -o -name "*.exs" \) ! -path "*/node_modules/*" ! -path "*/.git/*" ! -path "*/vendor/*" ! -path "*/__pycache__/*" ! -path "*/dist/*" ! -path "*/build/*"

If no files found, exit with: "No files to scan. Check your branch has changes or use --all."

4. Check for Existing LLM Artifacts Review

# Check if llm-artifacts review exists to avoid double-flagging
if [ -f .beagle/llm-artifacts-review.json ]; then
  echo "Found existing llm-artifacts review — will skip overlapping findings"
fi

Parse existing findings from

.beagle/llm-artifacts-review.json

if present. When consolidating, skip any finding where both the file:line and pattern type match an existing llm-artifacts finding (specifically

verbose_comment

and

over_documentation

types).

5. Classify Files by Type

Partition files into three groups:

Group File Types Patterns to Check

Prose

*.md

All 6 categories

Code Docs

*.py

*.ts

*.tsx

*.js

*.jsx

*.go

*.rs

*.java

*.rb

*.swift

*.kt

*.ex

*.exs

vocabulary, communication, filler, code_docs

Git Commit messages, PR descriptions content, vocabulary, communication, filler

For Git artifacts, collect recent commits:

# Commits on current branch not in main
git log --format="%H %s" $(git merge-base HEAD main)..HEAD

6. Spawn Parallel Subagents

If total items >= 4, spawn up to 3 subagents via

Task

tool. If

--category

is set, spawn a single agent for that category only.

Subagent 1: Prose Agent

Scope: Markdown files only Check: All 6 pattern categories Instructions:

Load
```
beagle-docs:review-ai-writing
```
skill
Read each markdown file
Scan for all pattern categories
Apply false positive checks from the skill
Return findings in the structured format

Subagent 2: Code Docs Agent

Scope: Source code files Check: vocabulary, communication, filler, code_docs categories Instructions:

Load
```
beagle-docs:review-ai-writing
```
skill
Extract docstrings and comments from each file
Scan for applicable pattern categories
Skip code itself — only check text in comments and docstrings
Return findings in the structured format

Subagent 3: Git Agent

Scope: Commit messages and PR descriptions Check: content, vocabulary, communication, filler categories Instructions:

Load
```
beagle-docs:review-ai-writing
```
skill
Read commit messages from the branch
If on a PR branch, read the PR description via
```
gh pr view --json body
```
Scan for applicable pattern categories
Use synthetic paths:
```
git:commit:<sha>
```
with line 0,
```
git:pr:<number>
```
with line 0
Return findings in the structured format

7. Consolidate Findings

Wait for all subagents to complete, then:

Merge all findings into a single list
Remove duplicates (same file:line and type)
Remove findings that overlap with
```
.beagle/llm-artifacts-review.json
```
Assign unique IDs (1, 2, 3...)
Group by category for display

8. Write JSON Report

Create

.beagle

directory if it doesn't exist:

mkdir -p .beagle

Write findings to

.beagle/ai-writing-review.json

{
  "version": "1.0.0",
  "created_at": "2025-01-15T10:30:00Z",
  "git_head": "abc1234",
  "scope": "changed",
  "files_scanned": 12,
  "commits_scanned": 5,
  "findings": [
    {
      "id": 1,
      "category": "vocabulary",
      "type": "ai_vocabulary_high",
      "file": "README.md",
      "line": 15,
      "original_text": "This library leverages cutting-edge algorithms to facilitate seamless data processing.",
      "description": "High-signal AI vocabulary: leverage, cutting-edge, facilitate, seamless",
      "suggestion": "This library uses streaming algorithms for fast data processing.",
      "risk": "Low",
      "fix_safety": "Safe",
      "fix_action": "rewrite"
    },
    {
      "id": 2,
      "category": "code_docs",
      "type": "tautological_docstring",
      "file": "src/auth.py",
      "line": 42,
      "original_text": "\"\"\"Get the user by ID.\"\"\"",
      "description": "Docstring restates function name get_user_by_id without adding value",
      "suggestion": "\"\"\"Raises UserNotFound if ID doesn't exist.\"\"\"",
      "risk": "Medium",
      "fix_safety": "Needs review",
      "fix_action": "rewrite"
    },
    {
      "id": 3,
      "category": "communication",
      "type": "chat_leak",
      "file": "git:commit:abc1234",
      "line": 0,
      "original_text": "Certainly! Here's the updated authentication flow",
      "description": "Chat leak in commit message: starts with 'Certainly! Here's'",
      "suggestion": "Update authentication flow",
      "risk": "Low",
      "fix_safety": "Safe",
      "fix_action": "rewrite"
    }
  ],
  "summary": {
    "total": 3,
    "by_category": {
      "vocabulary": 1,
      "code_docs": 1,
      "communication": 1
    },
    "by_risk": {
      "Low": 2,
      "Medium": 1
    },
    "by_fix_safety": {
      "Safe": 2,
      "Needs review": 1
    }
  }
}

9. Display Summary

## AI Writing Review

**Scope:** Changed files from main
**Files scanned:** 12 | **Commits scanned:** 5

### Findings by Category

#### Vocabulary (1 issue)

1. [README.md:15] **AI vocabulary** (Low, Safe)
   - High-signal AI vocabulary: leverage, cutting-edge, facilitate, seamless
   - Suggestion: Rewrite with simple words

#### Code Docs (1 issue)

2. [src/auth.py:42] **Tautological docstring** (Medium, Needs review)
   - Docstring restates function name without adding value
   - Suggestion: Add meaningful information or delete

#### Communication (1 issue)

3. [git:commit:abc1234:0] **Chat leak** (Low, Safe)
   - Commit message starts with "Certainly! Here's"
   - Suggestion: Rewrite as imperative commit message

### Summary Table

| Category | Safe | Needs Review | Total |
|----------|------|--------------|-------|
| Vocabulary | 1 | 0 | 1 |
| Code Docs | 0 | 1 | 1 |
| Communication | 1 | 0 | 1 |
| **Total** | **2** | **1** | **3** |

### Next Steps

- Run `/beagle-docs:humanize-beagle` to apply fixes
- Run `/beagle-docs:humanize-beagle --dry-run` to preview changes first
- Review the JSON report at `.beagle/ai-writing-review.json`

10. Verification

Before completing, verify:

JSON validity: Confirm
```
.beagle/ai-writing-review.json
```
exists and is parseable
Subagent success: All spawned subagents completed without errors
Git HEAD captured: The
```
git_head
```
field is non-empty
No double-flagging: Cross-check against
```
.beagle/llm-artifacts-review.json
```
if it exists

# Verify JSON is valid
python3 -c "import json; json.load(open('.beagle/ai-writing-review.json'))" 2>/dev/null && echo "Valid JSON" || echo "Invalid JSON"

If any verification fails, report the error and do not proceed.

Output Format for Each Finding

[FILE:LINE] ISSUE_TITLE
- Category: content | vocabulary | formatting | communication | filler | code_docs
- Type: specific_pattern_name
- Original: "the problematic text"
- Suggestion: "the improved text" or "delete"
- Risk: Low | Medium
- Fix Safety: Safe | Needs review

Rules

Always load

beagle-docs:review-ai-writing

and

beagle-core:review-verification-protocol

first

Use
```
Task
```
tool for parallel subagents when >= 4 items to scan
Every finding MUST have file:line reference (use synthetic paths for git artifacts)
Do not flag false positives listed in the skill
Do not duplicate findings from
```
.beagle/llm-artifacts-review.json
```
Create
```
.beagle
```
directory if needed
Write JSON report before displaying summary

Reference Material

AI Writing Detection for Developer Text

Detect patterns characteristic of AI-generated text in developer artifacts. These patterns reduce trust, add noise, and obscure meaning.

Pattern Categories

Category	Reference	Key Signals
Content	`references/content-patterns.md`	Promotional language, vague authority, formulaic structure, synthetic openers
Vocabulary	`references/vocabulary-patterns.md`	AI word tiers, copula avoidance, rhetorical devices, synonym cycling, commit inflation
Formatting	`references/formatting-patterns.md`	Boldface overuse, emoji decoration, heading restatement
Communication	`references/communication-patterns.md`	Chat leaks, cutoff disclaimers, sycophantic tone, apologetic errors
Filler	`references/filler-patterns.md`	Filler phrases, excessive hedging, generic conclusions
Code Docs	`references/code-docs-patterns.md`	Tautological docstrings, narrating obvious code, "This noun verbs", exhaustive enumeration

Scope

Scan these artifact types:

Artifact	File Patterns	Notes
Markdown docs	`*.md`	READMEs, guides, changelogs
Docstrings	`.py` , `.ts` , `.js` , `.go` , `.swift` , `.rs` , `.java` , `.kt` , `.rb` , `.ex`	Language-specific docstring formats
Code comments	Same as docstrings	Inline and block comments
Commit messages	`git log` output	Use synthetic path `git:commit:<sha>`
PR descriptions	GitHub PR body	Use synthetic path `git:pr:<number>`

What NOT to Scan

Generated code (lock files, compiled output, vendor directories)
Third-party content (copied license text, vendored docs)
Code itself (variable names, string literals used programmatically)
Test fixtures and mock data

Detection Rules

High-Confidence Signals (Always Flag)

These patterns are strong indicators of AI-generated text:

Chat leaks — "Certainly!", "I'd be happy to", "Great question!", "Here's" as sentence opener
Cutoff disclaimers — "As of my last update", "I cannot guarantee"
High-signal AI vocabulary — delve, utilize (as "use"), whilst, harnessing, paradigm, synergy
"This noun verbs" in docstrings — "This function calculates", "This method returns"
Synthetic openers — "In today's fast-paced", "In the world of"
Sycophantic code comments — "Excellent approach!", "Great implementation!"

Medium-Confidence Signals (Flag in Context)

Flag when 2+ appear together or pattern is repeated:

Low-signal AI vocabulary clusters — 3+ words from the low-signal list in one section
Formulaic structure — Rigid intro-body-conclusion in a README section
Heading restatement — First sentence after heading restates the heading
Excessive hedging — "might potentially", "could possibly", "it seems like it may"
Synonym cycling — Same concept called different names within one section
Boldface overuse — More than 30% of sentences contain bold text

Low-Confidence Signals (Note Only)

Mention but don't flag as issues:

Emoji in technical docs — May be intentional project style
Filler phrases — Some are common in human writing too
Generic conclusions — May be appropriate for summary sections
Commit inflation — Some teams prefer descriptive commits

False Positive Warnings

Do NOT flag these as AI-generated:

Pattern	Why It's Valid
"Ensure" in security docs	Standard term for security requirements
"Comprehensive" in test coverage discussion	Accurate technical descriptor
Formal tone in API reference docs	Expected register for reference material
"Leverage" in financial/business domain code	Domain-specific meaning, not AI filler
Bold formatting in CLI help text	Standard convention
Structured intro paragraphs in RFCs/ADRs	Expected format for these document types
"This module provides" in Python `__init__.py`	Idiomatic Python module docstring
Rhetorical questions in blog posts	Appropriate for informal content

Integration

With

beagle-core:review-verification-protocol

Before reporting any finding:

Read the surrounding context (full paragraph or function)
Confirm the pattern is AI-characteristic, not just formal writing
Check if the project has established conventions that match the pattern
Verify the suggestion improves clarity without changing meaning