PR-07: Skill Reorg — Utilities (Teil 1): Aphorisms, Browser, Cloudflare, CreateCLI, Documents, Evals, PAIUpgrade, Parser, Pdf by Steffen025 · Pull Request #74 · Steffen025/pai-opencode

Steffen025 · 2026-03-16T00:18:44Z

PR-07: Utilities Reorg Teil 1 (130 Dateien)

Zusammenfassung

Verschiebt 11 Skill-Verzeichnisse in die Utilities/ Namespace-Struktur als Teil der PAI v3.0 Skill-Reorganisation.

Betroffene Skills

Aphorisms → Utilities/Aphorisms
Browser → Utilities/Browser
Cloudflare → Utilities/Cloudflare
CreateCLI → Utilities/CreateCLI
CreateSkill → Utilities/CreateSkill
Delegation → Utilities/Delegation
Documents (inkl. Docx, Pdf, Pptx, Xlsx) → Utilities/Documents + Unterverzeichnisse
Evals → Utilities/Evals
PAIUpgrade → Utilities/PAIUpgrade
Parser → Utilities/Parser

Scope

130 Dateien (alle RENAME/ADD — neue Pfade)
Keine inhaltlichen Änderungen, reine Pfad-Reorganisation
Alte Pfade werden in PR-10 gelöscht

Review-Hinweis

Review für Korrektheit, OpenCode-Konformität, und verbleibende Claude→OpenCode Probleme

Jira

WARNEX-81

Summary by CodeRabbit

New Features
- Aphorisms: curated thematic aphorism database plus add/find/search/research workflows and usage tracking.
- Browser Automation v2.0: code-first Playwright browser, persistent session API, CLI tools (navigate/screenshot/verify/extract/interact), examples, and test suite.
- Evals Framework: new graders (code- & model-based), suite management, failure→task converter, Algorithm bridge and CLI utilities.
- Docx tooling: OOXML editors with tracked-changes/comments, pack/unpack/validate scripts.
Documentation
- Extensive SKILL.md and workflow docs across Aphorisms, Browser, Evals, Docx, CreateCLI/CreateSkill, Cloudflare.

Move 130 skill files into Utilities/ subdirectory structure: - Aphorisms → Utilities/Aphorisms - Browser → Utilities/Browser - Cloudflare → Utilities/Cloudflare - CreateCLI → Utilities/CreateCLI - CreateSkill → Utilities/CreateSkill - Documents (Docx, Pdf, Pptx, Xlsx) → Utilities/Documents - Evals → Utilities/Evals - Prompting → Utilities/Prompting - SECUpdates → Utilities/SECUpdates - System → Utilities/System - VoiceServer → Utilities/VoiceServer Part of PAI v3.0 skill reorganization (PR-07 of 12)

coderabbitai · 2026-03-16T00:19:20Z

Note

Reviews paused

It looks like this branch is under active development. To avoid overwhelming you with review comments due to an influx of new commits, CodeRabbit has automatically paused this review. You can configure this behavior by changing the reviews.auto_review.auto_pause_after_reviewed_commits setting.

Use the following commands to manage reviews:

@coderabbitai resume to resume automatic reviews.
@coderabbitai review to trigger a single review.

Use the checkboxes below for quick actions:

▶️ Resume reviews
🔍 Trigger review

📝 Walkthrough

Walkthrough

Adds many new Utilities skills and tooling: an Aphorisms database and workflows; a Playwright-based Browser v2 (library, persistent session server, CLI, examples, tests); OOXML/Docx Python editor and pack/unpack/validate tools; a full Evals framework (graders, suite management, tools); plus numerous SKILL docs and workflows for CreateCLI, CreateSkill, Cloudflare, Documents, Delegation, and related utilities.

Changes

Cohort / File(s)	Summary
Aphorisms Skill `.opencode/skills/Utilities/Aphorisms/Database/aphorisms.md`, `.opencode/skills/Utilities/Aphorisms/SKILL.md`, `.opencode/skills/Utilities/Aphorisms/Workflows/*`	Adds static aphorism database and comprehensive skill documentation and workflows (add/find/search/research), theme taxonomy and usage tracking.
Browser Skill v2.0 `.opencode/skills/Utilities/Browser/index.ts`, `.opencode/skills/Utilities/Browser/Tools/Browse.ts`, `.opencode/skills/Utilities/Browser/Tools/BrowserSession.ts`, `.opencode/skills/Utilities/Browser/package.json`, `.opencode/skills/Utilities/Browser/tsconfig.json`, `.opencode/skills/Utilities/Browser/Workflows/`, `.opencode/skills/Utilities/Browser/examples/*`, `.opencode/skills/Utilities/Browser/README.md`, `.opencode/skills/Utilities/Browser/SKILL.md`	Implements PlaywrightBrowser class with exported singleton and rich API, persistent BrowserSession HTTP server, Bun CLI, examples and comprehensive tests; adds package/tsconfig and workflow docs. Public TS types and instance exported.
Docx / OOXML Tools `.opencode/skills/Utilities/Docx/Scripts/*`, `.opencode/skills/Utilities/Docx/Scripts/document.py`, `.opencode/skills/Utilities/Docx/Scripts/utilities.py`, `.opencode/skills/Utilities/Docx/Scripts/__init__.py`, `.opencode/skills/Utilities/Docx/SKILL.md`, `.opencode/skills/Utilities/Docx/docx-js.md`, `.opencode/skills/Utilities/Docx/ooxml.md`, `.opencode/skills/Utilities/Docx/LICENSE.txt`	Adds Python OOXML library (XMLEditor, DocxXMLEditor, Document manager) with tracked-changes/comments APIs, pack/unpack/validate scripts, pretty-printing, CLI entry points and extensive DOCX/OOXML docs. New public classes/functions added.
Evals Framework `.opencode/skills/Utilities/Evals/`, `.opencode/skills/Utilities/Evals/Graders/`, `.opencode/skills/Utilities/Evals/Tools/`, `.opencode/skills/Utilities/Evals/Suites/`, `.opencode/skills/Utilities/Evals/Data/*`	Introduces System-Evals: PROJECT/SKILL docs, Base grader registry, many code- and model-based graders (registered), suite management utilities, AlgorithmBridge and FailureToTask tools, templates and domain patterns; new exported TS APIs and CLI utilities.
CreateCLI & CreateSkill `.opencode/skills/Utilities/CreateCLI/`, `.opencode/skills/Utilities/CreateSkill/`	Adds documentation, patterns, and workflows for CLI and Skill generation, canonicalization, migration, validation, and TypeScript patterns.
Delegation & Documents Skills `.opencode/skills/Utilities/Delegation/SKILL.md`, `.opencode/skills/Utilities/Documents/SKILL.md`, `.opencode/skills/Utilities/Documents/Workflows/ProcessLargePdfGemini3.md`	New Delegation guidance for multi-agent orchestration and Documents skill with multimodal PDF processing workflow.
Cloudflare Skill `.opencode/skills/Utilities/Cloudflare/SKILL.md`, `.opencode/skills/Utilities/Cloudflare/Workflows/*`	Adds Cloudflare deployment and troubleshooting workflow docs with verification and browser-check integration.
Misc / Config & Small Docs `.coderabbit.yaml`, assorted SKILL/workflow docs across Utilities	Minor locale config change and many new documentation files across Utilities (Evals best practices, Templates, CreateCLI patterns, Browser workflows, etc.).

Sequence Diagram(s)

sequenceDiagram
  autonumber
  participant CLI as "CLI (Browse.ts)"
  participant Session as "BrowserSession (HTTP API)"
  participant Playwright as "PlaywrightBrowser (index.ts)"
  participant Page as "Browser Page / Process"
  participant FS as "Filesystem / State"

  CLI->>Session: POST /navigate or /screenshot (args)
  Session->>Playwright: invoke navigate/capture/evaluate APIs
  Playwright->>Page: perform goto/click/evaluate/wait
  Page-->>Playwright: emit console / network / dialog events
  Playwright->>FS: persist session state, logs, screenshot
  Playwright-->>Session: return result (JSON / buffer)
  Session-->>CLI: HTTP response with payload + diagnostics

Estimated code review effort

🎯 5 (Critical) | ⏱️ ~120 minutes

Possibly related PRs

WP4: Integration & Validation - Phase 1 (Path Fixes) #38 — Overlapping reorganization and additions under skills/Utilities and skills/Thinking; likely affects shared skill routing and canonicalization.
WP-C: Core PAI System + Skill Fixes #45 — Overlaps Utilities/Delegation and related SKILL/workflow docs; potential conflicts in delegation guidance.
refactor(skills): introduce Thinking/ and Security/ category hierarchy [PR-03] #70 — Large skills additions (Security) and restructuring that may intersect skill layout and conventions introduced here.

Poem

🐰
I hop through lines of prose and code,
Collected quotes in tidy mode,
Browsers hum while snapshots glow,
Docx edits and Evals in a row,
A rabbit cheers — behold the new node!

✨ Finishing Touches

📝 Generate docstrings (stacked PR)
📝 Generate docstrings (commit on current branch)

🧪 Generate unit tests (beta)

Create PR with unit tests
Post copyable unit tests in a comment
Commit unit tests in branch release/v3.0-pr07-utilities-reorg-teil1

coderabbitai

Actionable comments posted: 11

Note

Due to the large number of review comments, Critical severity comments were prioritized as inline comments.

Caution

Some comments are outside the diff and can’t be posted inline due to platform limitations.

⚠️ Outside diff range comments (1)

.opencode/skills/Utilities/Browser/Workflows/Update.md (1)
81-87: ⚠️ Potential issue | 🟡 Minor

Fenced Code Block ohne Sprachkennung

Der Block ab Line 81 sollte eine Sprache bekommen (text reicht), damit Markdownlint sauber bleibt.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Browser/Workflows/Update.md around lines 81 - 87,
The fenced code block containing the lines starting with "# Last sync:
2026-01-03" is missing a language tag; update that triple-backtick fence to
include a language identifier (e.g., ```text) so Markdownlint stops flagging it
— locate the fenced block in Update.md and add the language after the opening
``` while keeping the block contents unchanged.

🟡 Minor comments (38)

.opencode/skills/Utilities/Delegation/SKILL.md-60-80 (1)

60-80: ⚠️ Potential issue | 🟡 Minor

Überschriftennummerierung ist inkonsistent (doppelte „3.“).

Bei Line 60 und Line 72 steht jeweils „### 3.“. Das erschwert Referenzen innerhalb der Skill-Doku; bitte Nummerierung durchgehend korrigieren.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Delegation/SKILL.md around lines 60 - 80, Die
Überschriftennummerierung ist inkonsistent: ändere die zweite "### 3." (die
Überschrift "Foreground Agents") auf "### 4." und passe alle nachfolgenden
Abschnittsnummern bei Bedarf an, so dass "Background Agents" bleibt "### 3." und
"Foreground Agents" eindeutig "### 4." ist; prüfe die Abschnitte "Background
Agents" und "Foreground Agents" im SKILL.md und korrigiere nur die sichtbaren
Nummern in den Markdown-Überschriften.

.opencode/skills/Utilities/Delegation/SKILL.md-50-52 (1)

50-52: ⚠️ Potential issue | 🟡 Minor

Fenced Code Blocks ohne Sprachkennung (Markdown-Lint MD040).

Bei den Blöcken ab Line 50, Line 64, Line 103, Line 148 und Line 160 fehlt die Sprachangabe. Bitte z. B. bash oder text ergänzen, damit Rendering/Linting konsistent bleibt.
Vorgeschlagener Patch
-```
+```text
 Task(subagent_type="Engineer", isolation: "worktree", prompt="...")
- +text
Task(subagent_type="Engineer", run_in_background: true, prompt="...")
-```
+```text
1. TeamCreate(team_name="my-project")           # Creates team + task list
2. TaskCreate(subject="Implement auth module")   # Create team tasks
3. Task(subagent_type="Engineer", team_name="my-project", name="auth-engineer")  # Spawn teammate
4. TaskUpdate(taskId="1", owner="auth-engineer") # Assign task
5. SendMessage(type="message", recipient="auth-engineer", content="...")  # Coordinate
- +text
Task(subagent_type="general-purpose", model="haiku", max_turns=3, prompt="...")
-```
+```text
Task(subagent_type="general-purpose", prompt="...")  # or specialized agent type
</details>



Also applies to: 64-66, 103-109, 148-150, 160-162

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Delegation/SKILL.md around lines 50 - 52, Several
fenced code blocks containing Task(...) examples and the numbered
TeamCreate/TaskCreate sequence lack a language tag; update each opening
triple-backtick fence for the blocks that include Task(subagent_type="Engineer",
isolation: "worktree", prompt="..."), Task(subagent_type="Engineer",
run_in_background: true, prompt="..."), the numbered block starting with "1.
TeamCreate(team_name="my-project")", Task(subagent_type="general-purpose",
model="haiku", max_turns=3, prompt="...") and
Task(subagent_type="general-purpose", prompt="...") to include an explicit
language identifier such as "text" or "bash" (e.g., change totext) so
Markdown lint MD040 is satisfied and rendering is consistent.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/Docx/Scripts/__init__.py-1-1 (1)</summary><blockquote>

`1-1`: _⚠️ Potential issue_ | _🟡 Minor_

**Dateiberechtigung korrigieren: Datei sollte nicht ausführbar sein.**

Die Datei hat das ausführbare Flag gesetzt, obwohl `__init__.py`-Dateien keine ausführbaren Skripte sind. Bitte entfernen Sie die Ausführungsberechtigung.



<details>
<summary>🔧 Vorgeschlagene Korrektur</summary>

Führen Sie folgenden Befehl aus, um die Berechtigung zu korrigieren:

```shell
chmod -x .opencode/skills/Utilities/Docx/Scripts/__init__.py
```

</details>

<details>
<summary>🤖 Prompt for AI Agents</summary>

```
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Docx/Scripts/__init__.py at line 1, The file
__init__.py in the Scripts package has its executable bit set; remove the
executable permission (unset the executable flag) so it is a plain Python
package file rather than a script (e.g., update file mode to remove execute
permissions for owner/group/others).
```

</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/CreateSkill.md-86-88 (1)</summary><blockquote>

`86-88`: _⚠️ Potential issue_ | _🟡 Minor_

**Markdownlint MD040 in eingebetteten Beispielblöcken**

Die inneren Fences haben keine Sprachangabe. Bitte mit `text`/`markdown` ergänzen, damit die Datei lint-konform ist.  



Also applies to: 102-107, 110-114, 119-119

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/CreateSkill.md around lines
86 - 88, Update the nested fenced code blocks in the markdown examples so the
inner fences include a language tag (e.g., text or markdown) to satisfy
markdownlint MD040; specifically edit the example that contains the text
"Running WorkflowName in SkillName..." and the other similar embedded
blocks in the same document so each inner fence starts with a language
identifier instead of an empty fence.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/UpdateSkill.md-22-24 (1)</summary><blockquote>

`22-24`: _⚠️ Potential issue_ | _🟡 Minor_

**Markdownlint MD040 auf dem Pfad-Block**

Der Codeblock hat keine Sprachangabe. Bitte z. B. `text` verwenden.

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/UpdateSkill.md around lines
22 - 24, In der Datei UpdateSkill.md im vorhandenen dreifachen
Backtick-Codeblock mit Inhalt "~/.opencode/skills/CORE/SkillSystem.md" fehlt die
Sprachangabe, was MD040 auslöst; fixiere das, indem du den Codeblock von auf z.B.text (oder eine passende Sprache) änderst, so dass der Block eine
Sprachkennzeichnung erhält und den Markdownlint-Warnhinweis behebt.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/CanonicalizeSkill.md-289-289 (1)</summary><blockquote>

`289-289`: _⚠️ Potential issue_ | _🟡 Minor_

**Verzeichnisname auf Line 289 auf `Tools/` standardisieren**

Die Checkliste nutzt `tools/`, während der Rest der Doku mit `Tools/` arbeitet. Bitte vereinheitlichen.

<details>
<summary>🤖 Prompt for AI Agents</summary>

```
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/CanonicalizeSkill.md at
line 289, Die Checkliste-Zeile mit dem Eintrag "`tools/` directory exists (even
if empty)" ist inkonsistent zur restlichen Dokumentation, die "`Tools/`"
verwendet; aktualisiere diesen Eintrag im Dokument (CanonicalizeSkill.md) so
dass `tools/` zu `Tools/` geändert wird, damit die Verzeichnisbenennung
durchgängig gleich ist.
```

</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/SKILL.md-239-240 (1)</summary><blockquote>

`239-240`: _⚠️ Potential issue_ | _🟡 Minor_

**Verzeichnisname inkonsistent: `tools/` vs `Tools/`**

In den Beispielen wird auf Line 239 `tools/` verwendet, während der Rest des Dokuments `Tools/` (TitleCase) fordert. Auf case-sensitiven Systemen führt das zu falschen Pfaden.

<details>
<summary>🤖 Prompt for AI Agents</summary>

```
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/SKILL.md around lines 239 - 240, Die
Verzeichnisbezeichnung im Beispiel ist inkonsistent: ersetze im
SKILL.md-Eintrag, der aktuell "tools/" zeigt, durch die TitleCase-Variante
"Tools/" so dass alle Vorkommen mit dem restlichen Dokument übereinstimmen;
überprüfe und aktualisiere insbesondere die Zeile mit "→ Creates SKILL.md,
Workflows/, tools/" und alle ähnlichen Beispiele, damit sie "Tools/" verwenden
(Suche nach "tools/" und passe alle Treffer an).
```

</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/CanonicalizeSkill.md-149-185 (1)</summary><blockquote>

`149-185`: _⚠️ Potential issue_ | _🟡 Minor_

**Schrittnummerierung doppelt vergeben (`Step 6`)**

Es gibt zwei verschiedene `Step 6`-Abschnitte. Bitte neu nummerieren, damit der Ablauf eindeutig bleibt.

<details>
<summary>🤖 Prompt for AI Agents</summary>

```
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/CanonicalizeSkill.md around
lines 149 - 185, Die Datei enthält zwei identische Überschriften "Step 6"
(genannt "Step 6: Convert YAML Frontmatter" und "Step 6: Add Workflow Routing to
Body"), wodurch die Nummerierung doppelt ist; bitte die zweite "Step 6: Add
Workflow Routing to Body" und alle nachfolgenden Schritte um eine Stufe
hochzählen (z.B. in "Step 7: Add Workflow Routing to Body") und konsistent alle
referenzierten Step-Nummern im Dokument aktualisieren, sodass die Schrittfolge
eindeutig bleibt.
```

</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/ValidateSkill.md-149-157 (1)</summary><blockquote>

`149-157`: _⚠️ Potential issue_ | _🟡 Minor_

**`tools/` und `Tools/` sind gemischt verwendet**

In der Strukturprüfung steht `tools/`, später aber `Tools/` (z. B. Line 163). Das sollte auf einen Namensstandard vereinheitlicht werden.

<details>
<summary>🤖 Prompt for AI Agents</summary>

```
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/ValidateSkill.md around
lines 149 - 157, The document mixes "tools/" and "Tools/" directory names;
standardize on a single casing (use "tools/" lowercase) throughout
ValidateSkill.md by replacing any occurrences of "Tools/" with "tools/"
(including headings, checklist items, examples and the Step 7a section) and
ensure any related checks or prose consistently refer to the lowercase "tools/"
directory.
```

</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/CreateSkill.md-216-217 (1)</summary><blockquote>

`216-217`: _⚠️ Potential issue_ | _🟡 Minor_

**Checklist verweist auf falsches Verzeichnis (`tools/`)**

Auf Line 216 wird `tools/` gefordert, obwohl im Workflow sonst `Tools/` angelegt/verwendet wird (z. B. Lines 50, 56). Bitte einheitlich `Tools/` nutzen.

<details>
<summary>🤖 Prompt for AI Agents</summary>

```
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/CreateSkill.md around lines
216 - 217, Die Checkliste benutzt fälschlich das Verzeichnisnamen-Symbol
`tools/`; ändere den Eintrag "- [ ] `tools/` directory exists (even if empty)"
zu "- [ ] `Tools/` directory exists (even if empty)" damit es konsistent mit den
übrigen Verweisen auf `Tools/` im Workflow (z. B. die an anderen Stellen
verwendeten `Tools/`-Referenzen) ist.
```

</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/CanonicalizeSkill.md-22-24 (1)</summary><blockquote>

`22-24`: _⚠️ Potential issue_ | _🟡 Minor_

**Markdownlint MD040: fehlende Sprach-Tags bei mehreren Fenced Blöcken**

Bitte für alle betroffenen Blöcke explizite Sprachen setzen (`text`, `bash`, `markdown`).  



Also applies to: 62-65, 68-71, 74-77, 80-83, 112-115, 118-121, 124-127, 197-199, 250-250, 257-257

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/CanonicalizeSkill.md around
lines 22 - 24, In CanonicalizeSkill.md there are multiple fenced code blocks
using plain ``` with no language tag (e.g. the block showing
"~/.opencode/skills/CORE/SkillSystem.md"); update each such fence to include an
explicit language token (use bash for shell paths/commands, text for plain paths
or outputs, markdown for embedded markdown snippets) so markdownlint MD040 is
satisfied; ensure every affected triple-backtick block in the file is modified
to include the appropriate language tag.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/SKILL.md-51-57 (1)</summary><blockquote>

`51-57`: _⚠️ Potential issue_ | _🟡 Minor_

**Markdownlint MD040: Sprachangaben bei mehreren Fenced Code Blocks fehlen**

Mehrere Codeblöcke sind ohne Sprach-Tag definiert. Bitte z. B. `bash`, `text` oder `markdown` setzen, damit Lint sauber bleibt.  



Also applies to: 61-67, 123-128, 131-137, 143-153, 218-220, 234-241, 244-251, 254-261

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/SKILL.md around lines 51 - 57,
Several fenced code blocks in SKILL.md are missing language tags (e.g., the
block that begins with "skills/SkillName/SKILL.md # Skill
root") which triggers markdownlint MD040; update every fenced code block in this
file to include an explicit language (use "text" for plain file lists, "bash"
for shell snippets, or "markdown" for embedded markdown), including the other
blocks referenced in the comment ranges (61-67, 123-128, 131-137, 143-153,
218-220, 234-241, 244-251, 254-261) so all triple-backtick fences become like
text or bash as appropriate to satisfy the linter.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/CreateSkill/Workflows/ValidateSkill.md-22-24 (1)</summary><blockquote>

`22-24`: _⚠️ Potential issue_ | _🟡 Minor_

**Markdownlint MD040: Sprachangaben in mehreren Codeblöcken ergänzen**

Die betroffenen Fenced Blocks sollten Sprach-Tags erhalten (`text`, `bash`, `markdown`).  



Also applies to: 98-100, 117-121

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/CreateSkill/Workflows/ValidateSkill.md around
lines 22 - 24, Several fenced code blocks (e.g., the ones containing the literal
string "~/.opencode/skills/CORE/SkillSystem.md" and the other blocks noted in
the review) are missing language tags and trigger markdownlint MD040; edit each
fenced block in ValidateSkill.md to add an appropriate language tag (use text
for plain paths, bash for shell snippets, or markdown for embedded markdown)
immediately after the opening backticks so the blocks become text, bash or

.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts-47-56 (1)

47-56: ⚠️ Potential issue | 🟡 Minor

Fehlerbehandlung für fehlerhafte JSON-Zeilen fehlt.

Wenn eine Zeile in der JSONL-Datei fehlerhaft ist, wirft JSON.parse einen Fehler und die gesamte Funktion schlägt fehl. Bei einer persistenten Datendatei sollte die Funktion robuster sein.

🛡️ Vorgeschlagene Korrektur mit Fehlerbehandlung

 export function loadFailures(): FailureLog[] {
   if (!existsSync(FAILURES_LOG)) return [];

   const content = readFileSync(FAILURES_LOG, 'utf-8');
   return content
     .trim()
     .split('\n')
     .filter(Boolean)
-    .map(line => JSON.parse(line) as FailureLog);
+    .map((line, index) => {
+      try {
+        return JSON.parse(line) as FailureLog;
+      } catch (e) {
+        console.warn(`Warnung: Fehlerhafte Zeile ${index + 1} übersprungen`);
+        return null;
+      }
+    })
+    .filter((f): f is FailureLog => f !== null);
 }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts around lines 47 -
56, The loadFailures function currently calls JSON.parse on every line and will
throw if any line is malformed; update loadFailures to handle per-line parse
errors by wrapping the JSON.parse(line) call in a try/catch, skip lines that
fail to parse (optionally collecting or logging the parse error and the
offending line), and return only successfully parsed FailureLog objects;
reference the loadFailures function and the FAILURE_LOG constant (FAILURES_LOG)
so you modify the map over lines to safe parsing rather than directly calling
JSON.parse.

.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts-289-295 (1)

289-295: ⚠️ Potential issue | 🟡 Minor

Fehlende Laufzeitvalidierung für Severity-Parameter.

Die Type-Assertion values.severity as 'low' | 'medium' | 'high' | 'critical' validiert nicht zur Laufzeit. Bei ungültiger Eingabe wie -s invalid wird der Wert ohne Prüfung gespeichert.

🛡️ Vorgeschlagene Korrektur mit Validierung

+    const validSeverities = ['low', 'medium', 'high', 'critical'] as const;
+    const severity = validSeverities.includes(values.severity as any)
+      ? (values.severity as 'low' | 'medium' | 'high' | 'critical')
+      : 'medium';
+
     const failure = logFailure({
       description: args.join(' '),
       category: values.category ?? 'unknown',
-      severity: values.severity as 'low' | 'medium' | 'high' | 'critical',
+      severity,
       expected_behavior: values.expected,
       actual_behavior: values.actual,
     });

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts around lines 289 -
295, Die Severity wird derzeit mit einer Type-Assertion (`values.severity as
'low' | 'medium' | 'high' | 'critical'`) nur statisch angenommen; füge eine
Laufzeitvalidierung hinzu bevor du `logFailure` aufrufst: prüfe
`values.severity` gegen ein Whitelist-Array oder Set von
['low','medium','high','critical'], und verwende nur den geprüften Wert (oder
setze eine sichere Default wie 'low'/'unknown' oder wirf/logge einen Fehler)
statt die rohe Eingabe; aktualisiere die Stelle in FailureToTask.ts rund um den
`failure = logFailure({... severity: ...})`-Aufruf und erwäge eine kleine
Hilfsfunktion `isValidSeverity` zum Wiederverwenden.

.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts-93-107 (1)

93-107: ⚠️ Potential issue | 🟡 Minor

Inkonsistente Behandlung der Kategorie-Groß-/Kleinschreibung.

inferDomain (Zeile 87) normalisiert die Kategorie mit .toLowerCase(), aber inferGraders verwendet die Kategorie direkt in includes(). Bei gemischter Schreibweise wie "File_Targeting" wird die Domain korrekt erkannt, aber die Grader-Logik greift nicht.

♻️ Vorgeschlagene Korrektur für konsistente Normalisierung

 function inferGraders(category: string, failure: FailureLog): GraderConfig[] {
   const graders: GraderConfig[] = [];
+  const normalizedCategory = category.toLowerCase();

   // Always add tool call verification for coding/agent failures
-  if (['file_targeting', 'wrong_file', 'tool_sequence'].includes(category)) {
+  if (['file_targeting', 'wrong_file', 'tool_sequence'].includes(normalizedCategory)) {
     graders.push({

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts around lines 93 -
107, The graders selection in inferGraders uses category without normalization
causing mismatches with inferDomain which calls .toLowerCase(); modify
inferGraders to normalize the incoming category (e.g., const normalized =
category.toLowerCase()) and use normalized in the includes() check and in any
subsequent comparisons for the 'file_targeting', 'wrong_file', 'tool_sequence'
branch so mixed-case inputs like "File_Targeting" trigger the tool_calls grader
consistently with inferDomain.

.opencode/skills/Utilities/Evals/PROJECT.md-733-809 (1)

733-809: ⚠️ Potential issue | 🟡 Minor

Dateistruktur in PROJECT.md ist veraltet und stimmt nicht mit der tatsächlichen Verzeichnisstruktur überein.

Die dokumentierte Dateistruktur in PROJECT.md unterscheidet sich erheblich von der tatsächlichen Struktur in der Evals-Skill. Aktualisieren Sie die Dokumentation:

Dokumentiert (falsch):

cli/, scorers/, use-cases/, results/, storage/

Tatsächlich vorhanden:

Data/, Graders/, Suites/, Tools/, Types/, UseCases/, Workflows/

Die Verzeichnisse Tools/ und Workflows/ existieren bereits gemäß PAI v3.0 Konventionen, sind aber nicht im Dokumentations-Diagramm enthalten. Zusätzlich haben mehrere Verzeichnisse andere Namen (z.B. Graders statt scorers, UseCases statt use-cases). Bitte aktualisieren Sie die Dateistruktur-Dokumentation um die aktuelle Realität widerzuspiegeln.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/PROJECT.md around lines 733 - 809, Update
PROJECT.md's file-structure diagram to match the actual Evals-Skill layout:
replace the outdated entries (cli/, scorers/, use-cases/, results/, storage/)
with the real top-level directories (Data/, Graders/, Suites/, Tools/, Types/,
UseCases/, Workflows/) and ensure correct casing and PAI v3.0 conventions; keep
the same diagram style and examples (e.g., retain lines for README.md,
package.json, tsconfig.json) and add brief notes if any legacy dirs still exist
so the diagram in PROJECT.md accurately reflects the repository.

.opencode/skills/Utilities/Documents/Workflows/ProcessLargePdfGemini3.md-619-619 (1)

619-619: ⚠️ Potential issue | 🟡 Minor

Kleine Sprachkorrektur: zusammengesetztes Adjektiv mit Bindestrich.

„~30 page research paper“ → „~30-page research paper“.

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Documents/Workflows/ProcessLargePdfGemini3.md at
line 619, Die Phrase in der Überschrift "Typical Processing Times (on ~30 page
research paper):" sollte das zusammengesetzte Adjektiv korrekt mit Bindestrich
schreiben; ersetze "~30 page research paper" durch "~30-page research paper" in
der Zeile mit "Typical Processing Times (on ~30 page research paper):".

.opencode/skills/Utilities/Evals/TemplateIntegration.md-57-62 (1)

57-62: ⚠️ Potential issue | 🟡 Minor

Gleiche Pfadinkonsistenz bei Rubric-Beispiel.

Auch hier sollte der Pfad auf Utilities/Evals/ aktualisiert werden.
🔧 Vorgeschlagene Korrektur
 ```bash
 bun run ~/.opencode/Templates/Tools/RenderTemplate.ts \
   -t Evals/Rubric.hbs \
-  -d ~/.opencode/skills/Evals/UseCases/<name>/rubric.yaml \
-  -o ~/.opencode/skills/Evals/UseCases/<name>/rubric.md
+  -d ~/.opencode/skills/Utilities/Evals/UseCases/<name>/rubric.yaml \
+  -o ~/.opencode/skills/Utilities/Evals/UseCases/<name>/rubric.md
</details>

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/TemplateIntegration.md around lines 57 -
62, Update the example command that runs RenderTemplate.ts so the data (-d) and
output (-o) paths point to the Utilities/Evals location rather than
skills/Evals; specifically edit the invocation of
~/.opencode/Templates/Tools/RenderTemplate.ts (the command referencing
Evals/Rubric.hbs) and change any occurrences of
~/.opencode/skills/Evals/UseCases//rubric.yaml and
~/.opencode/skills/Evals/UseCases//rubric.md to
~/.opencode/skills/Utilities/Evals/UseCases//rubric.yaml and
~/.opencode/skills/Utilities/Evals/UseCases//rubric.md respectively.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/Evals/TemplateIntegration.md-21-25 (1)</summary><blockquote>

`21-25`: _⚠️ Potential issue_ | _🟡 Minor_

**Pfadinkonsistenz nach Skill-Reorganisation.**

Die CLI-Beispiele verwenden den alten Pfad `~/.opencode/skills/Evals/` statt des neuen Pfads `~/.opencode/skills/Utilities/Evals/`. Dies könnte zu Verwirrung oder Ausführungsfehlern führen.


<details>
<summary>🔧 Vorgeschlagene Korrektur</summary>

```diff
 ```bash
 bun run ~/.opencode/Templates/Tools/RenderTemplate.ts \
   -t Evals/Judge.hbs \
-  -d ~/.opencode/skills/Evals/UseCases/<name>/judge-config.yaml \
-  -o ~/.opencode/skills/Evals/UseCases/<name>/judge-prompt.md
+  -d ~/.opencode/skills/Utilities/Evals/UseCases/<name>/judge-config.yaml \
+  -o ~/.opencode/skills/Utilities/Evals/UseCases/<name>/judge-prompt.md
</details>

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/TemplateIntegration.md around lines 21 -
25, Die CLI-Beispiele verwenden noch den alten Pfad
~/.opencode/skills/Evals/UseCases/<name>/...; aktualisiere alle occurrences in
der Anleitung so die Befehle auf
~/.opencode/skills/Utilities/Evals/UseCases/<name>/judge-config.yaml und
.../judge-prompt.md zeigen (bei dem Render-Template-Aufruf mit
RenderTemplate.ts und der Vorlage Evals/Judge.hbs), also ersetze die beiden -d
und -o Pfade auf die neuen Utilities/Evals Pfade.
</details>

</blockquote></details>
<details>
<summary>.opencode/skills/Utilities/Evals/CLIReference.md-51-54 (1)</summary><blockquote>

`51-54`: _⚠️ Potential issue_ | _🟡 Minor_

**Pfad zum EvalServer-Verzeichnis aktualisieren.**


<details>
<summary>🔧 Vorgeschlagene Korrektur</summary>

```diff
-cd ~/.opencode/skills/Evals/EvalServer
+cd ~/.opencode/skills/Utilities/Evals/EvalServer
 bun run dev  # Starts on http://localhost:5173
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/CLIReference.md around lines 51 - 54,
Aktualisiere den Pfad zur EvalServer-Startanweisung: ersetze die Zeile mit "cd
~/.opencode/skills/Evals/EvalServer" durch den korrekten Verzeichnis-Pfad zum
EvalServer (z. B. "cd ~/.opencode/skills/Utilities/Evals/EvalServer") und lasse
die anschließende Startanweisung "bun run dev  # Starts on
http://localhost:5173" unverändert; überprüfe, dass der Pfadstring in der Datei
CLIReference.md mit dem tatsächlichen Projektverzeichnis übereinstimmt.

.opencode/skills/Utilities/Evals/CLIReference.md-68-83 (1)

68-83: ⚠️ Potential issue | 🟡 Minor

Storage-Pfade und fehlende Sprachspezifikation.

Der Verzeichnisbaum verwendet den alten Pfad und benötigt eine Sprachspezifikation.

🔧 Vorgeschlagene Korrektur

-```
-~/.opencode/skills/Evals/
+```text
+~/.opencode/skills/Utilities/Evals/
 ├── UseCases/

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/CLIReference.md around lines 68 - 83,
Update the example directory header to the correct path and add a language
specifier for the code block: replace the old path string
"~/.opencode/skills/Evals/" with "~/.opencode/skills/Utilities/Evals/" and
ensure the fenced block is marked (e.g., "text") so the tree snippet is rendered
correctly; locate the snippet that begins with "~/.opencode/skills/Evals/" and
adjust that header and its fence language while leaving the UseCases/ Results/
EvalServer/ structure unchanged.

.opencode/skills/Utilities/Evals/SKILL.md-113-116 (1)

113-116: ⚠️ Potential issue | 🟡 Minor

Pfadinkonsistenz in ALGORITHM-Integration.

Auch der AlgorithmBridge-Pfad in diesem Abschnitt muss aktualisiert werden.

🔧 Vorgeschlagene Korrektur

 # Run eval and update ISC row
-bun run ~/.opencode/skills/Evals/Tools/AlgorithmBridge.ts -s regression-core -r 3 -u
+bun run ~/.opencode/skills/Utilities/Evals/Tools/AlgorithmBridge.ts -s regression-core -r 3 -u

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/SKILL.md around lines 113 - 116, Die
Dokumentation enthält eine veraltete Pfadangabe zur AlgorithmBridge-Skriptdatei;
aktualisiere die bun run-Aufrufzeile in
.opencode/skills/Utilities/Evals/SKILL.md so dass sie auf den aktuellen
Speicherort des AlgorithmBridge-Skripts verweist (verwende den korrekten Pfad
zur Datei AlgorithmBridge.ts im neuen Tools-/Evals-Layout) und prüfe, dass der
Aufruf weiterhin die Flags (-s, -r, -u) unverändert übergibt.

.opencode/skills/Utilities/Evals/SKILL.md-92-107 (1)

92-107: ⚠️ Potential issue | 🟡 Minor

Mehrere Pfadinkonsistenzen in CLI-Referenz.

Alle CLI-Pfade verwenden das alte Schema ~/.opencode/skills/Evals/ statt ~/.opencode/skills/Utilities/Evals/. Dies betrifft AlgorithmBridge.ts, FailureToTask.ts und SuiteManager.ts.

🔧 Vorgeschlagene Korrektur für alle CLI-Befehle

 # Run an eval suite
-bun run ~/.opencode/skills/Evals/Tools/AlgorithmBridge.ts -s <suite>
+bun run ~/.opencode/skills/Utilities/Evals/Tools/AlgorithmBridge.ts -s <suite>

 # Log a failure for later conversion
-bun run ~/.opencode/skills/Evals/Tools/FailureToTask.ts log "description" -c category -s severity
+bun run ~/.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts log "description" -c category -s severity

 # Convert failures to test tasks
-bun run ~/.opencode/skills/Evals/Tools/FailureToTask.ts convert-all
+bun run ~/.opencode/skills/Utilities/Evals/Tools/FailureToTask.ts convert-all

 # Manage suites
-bun run ~/.opencode/skills/Evals/Tools/SuiteManager.ts create <name> -t capability -d "description"
-bun run ~/.opencode/skills/Evals/Tools/SuiteManager.ts list
-bun run ~/.opencode/skills/Evals/Tools/SuiteManager.ts check-saturation <name>
-bun run ~/.opencode/skills/Evals/Tools/SuiteManager.ts graduate <name>
+bun run ~/.opencode/skills/Utilities/Evals/Tools/SuiteManager.ts create <name> -t capability -d "description"
+bun run ~/.opencode/skills/Utilities/Evals/Tools/SuiteManager.ts list
+bun run ~/.opencode/skills/Utilities/Evals/Tools/SuiteManager.ts check-saturation <name>
+bun run ~/.opencode/skills/Utilities/Evals/Tools/SuiteManager.ts graduate <name>

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/SKILL.md around lines 92 - 107, Update the
CLI examples in SKILL.md to use the correct path prefix by replacing occurrences
of ~/.opencode/skills/Evals/ with ~/.opencode/skills/Utilities/Evals/ for the
command examples referencing AlgorithmBridge.ts, FailureToTask.ts, and
SuiteManager.ts; edit the section containing the bash snippets so each
invocation (AlgorithmBridge.ts, FailureToTask.ts log/convert-all, and
SuiteManager.ts create/list/check-saturation/graduate) points to
~/.opencode/skills/Utilities/Evals/ to ensure the documented paths match the
actual file locations.

.opencode/skills/Utilities/Evals/CLIReference.md-17-43 (1)

17-43: ⚠️ Potential issue | 🟡 Minor

Pfadinkonsistenzen in allen CLI-Befehlen.

Alle CLI-Pfade verweisen auf das alte Schema ~/.opencode/skills/Evals/ statt auf den neuen Pfad ~/.opencode/skills/Utilities/Evals/.

🔧 Vorgeschlagene Korrektur

 # Create new use case
-bun run ~/.opencode/skills/Evals/EvalServer/cli.ts use-case create --name <name>
+bun run ~/.opencode/skills/Utilities/Evals/EvalServer/cli.ts use-case create --name <name>

 # List all use cases
-bun run ~/.opencode/skills/Evals/EvalServer/cli.ts use-case list
+bun run ~/.opencode/skills/Utilities/Evals/EvalServer/cli.ts use-case list

 # Show use case details
-bun run ~/.opencode/skills/Evals/EvalServer/cli.ts use-case show <name>
+bun run ~/.opencode/skills/Utilities/Evals/EvalServer/cli.ts use-case show <name>

Gleiches gilt für Test Case Management (Zeilen 30-36) und Run Evaluations (Zeilen 40-43).

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/CLIReference.md around lines 17 - 43,
Update all example CLI command paths to use the new directory prefix
~/.opencode/skills/Utilities/Evals/ instead of the old
~/.opencode/skills/Evals/; specifically edit the use-case commands (references
to cli.ts), test-case commands (add/list referencing cli.ts), and the run
evaluation command (cli-run.ts) so each invocation points to
~/.opencode/skills/Utilities/Evals/cli.ts or
~/.opencode/skills/Utilities/Evals/cli-run.ts respectively, ensuring consistent
paths throughout the file.

.opencode/skills/Utilities/Evals/Graders/ModelBased/PairwiseComparison.ts-67-81 (1)

67-81: ⚠️ Potential issue | 🟡 Minor

Inkonsistenz zwischen aggregateWinner und score bei Position-Swap.

Die Score-Berechnung in Zeilen 78-81 überschreibt den zuvor gesetzten Score, aber aggregateWinner bleibt unverändert. Dadurch kann z.B. aggregateWinner = 'output' mit score = 0.25 kombiniert werden, was widersprüchlich ist.

♻️ Konsistente Berechnung

+    // Berechne Score und Winner konsistent
+    if (positionSwap && results.length === 2) {
+      score = (outputWins + ties * 0.5) / 2;
+      if (score > 0.5) {
+        aggregateWinner = 'output';
+      } else if (score < 0.5) {
+        aggregateWinner = 'reference';
+      } else {
+        aggregateWinner = 'tie';
+      }
+    } else {
       if (outputWins > referenceWins) {
         score = 1.0;
         aggregateWinner = 'output';
       } else if (referenceWins > outputWins) {
         score = 0.0;
         aggregateWinner = 'reference';
       } else {
         score = 0.5;
         aggregateWinner = 'tie';
       }
-
-    // For the score, also consider partial wins
-    if (positionSwap && results.length === 2) {
-      score = (outputWins + ties * 0.5) / 2;
     }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/Graders/ModelBased/PairwiseComparison.ts
around lines 67 - 81, The position-swap branch updates score without updating
aggregateWinner, causing inconsistent pairs (e.g., aggregateWinner='output' with
score=0.25); after computing the adjusted score in the positionSwap &&
results.length === 2 block (which uses outputWins, referenceWins, ties),
recompute and set aggregateWinner so it matches the new score (e.g.,
aggregateWinner = 'output' if score > 0.5, 'reference' if score < 0.5, 'tie' if
score === 0.5), keeping the existing variables score and aggregateWinner in
PairwiseComparison.ts consistent.

.opencode/skills/Utilities/Evals/Graders/Base.ts-94-122 (1)

94-122: ⚠️ Potential issue | 🟡 Minor

Fehlende Fehlerbehandlung in runGraders.

Wenn ein Grader während grade() eine Exception wirft, bricht die gesamte Aggregation ab. Die anderen Grader werden nicht ausgeführt.

♻️ Robustere Implementierung

   for (const grader of graders) {
-    const result = await grader.grade(context);
-    results.push(result);
+    try {
+      const result = await grader.grade(context);
+      results.push(result);
+    } catch (e) {
+      // Erzeuge Fehler-Ergebnis statt Abbruch
+      results.push({
+        grader_type: grader.type,
+        weight: grader.getWeight(),
+        score: 0,
+        passed: false,
+        duration_ms: 0,
+        reasoning: `Grader error: ${e}`,
+      });
+      if (grader.isRequired()) {
+        allRequiredPassed = false;
+      }
+    }

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/Graders/Base.ts around lines 94 - 122, The
runGraders function should catch exceptions from each grader.grade call so one
failing grader doesn't stop the loop; wrap the await grader.grade(context) in a
try/catch, and on error push a GraderResult that indicates failure (e.g., score
0, passed false, include an error message in result.details or similar) while
still using grader.getWeight() and grader.isRequired() for aggregation; ensure
totalWeight/weightedSum logic remains correct (use 0 score for errored graders)
and that allRequiredPassed is set false for required graders that errored so
final aggregate_score and passed reflect the failure.

.opencode/skills/Utilities/Evals/Graders/ModelBased/NaturalLanguageAssert.ts-109-117 (1)

109-117: ⚠️ Potential issue | 🟡 Minor

Fallback-Parsing ist unzuverlässig.

Die Fallback-Logik prüft nur, ob der Text "assertion X" UND "true" enthält, unabhängig vom Kontext. Ein Satz wie "assertion 1 is definitely not true" würde fälschlicherweise als passed: true interpretiert.

♻️ Vorgeschlagene Verbesserung

       // Try to find by content if numbered format didn't work
-      const containsTrue = text.toLowerCase().includes(`assertion ${i + 1}`) &&
-        text.toLowerCase().includes('true');
+      // Fallback: Suche nach "assertion X: true" oder ähnlichen Mustern
+      const fallbackPattern = new RegExp(
+        `assertion\\s*${i + 1}[:\\s]*(is\\s+)?true`,
+        'i'
+      );
+      const containsTrue = fallbackPattern.test(text);
 
       return {
         assertion,
         passed: containsTrue,
-        explanation: 'Could not parse result',
+        explanation: containsTrue ? 'Parsed via fallback' : 'Could not parse result',
       };

🤖 Prompt for AI Agents

Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Evals/Graders/ModelBased/NaturalLanguageAssert.ts
around lines 109 - 117, The current fallback uses a simple containsTrue check
that looks for "assertion X" and "true" anywhere, causing false positives;
replace that with a targeted parsing using a regex or token scan that locates
the specific "assertion {i+1}" occurrence and then inspects the immediate clause
for a boolean value (e.g., "assertion 1: true", "assertion 1 is true", or
"assertion 1 — false") while explicitly rejecting matches where negation words
("not", "isn't", "never", "no", "false") appear in close proximity to "true";
update the logic around the containsTrue variable (and the returned passed
value) so it uses this clause-scoped result and still falls back to 'Could not
parse result' if no unambiguous boolean is found.

.opencode/skills/Utilities/Cloudflare/SKILL.md-27-27 (1)

27-27: ⚠️ Potential issue | 🟡 Minor

Markdownlint-Befunde (MD040/MD058) bitte bereinigen.

Mehrere Code-Fences haben keine Sprachangabe (Lines 27, 42, 84, 92, 100), und die Tabelle um Line 72 sollte von Leerzeilen umgeben sein.

Also applies to: 42-42, 72-74, 84-84, 92-92, 100-100
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Cloudflare/SKILL.md at line 27, Update the
Markdown to satisfy markdownlint rules by adding a language identifier to each
code fence at the specified locations (the triple-backtick code fences at lines
27, 42, 84, 92, and 100) — choose the appropriate language tag for the snippet
(e.g., json, bash, js, md) so each fence becomes ```<lang> ... ```, and ensure
the table near line 72 is surrounded by a blank line above and below it (insert
one empty line before the table start and one after the table end) so the table
is properly separated from surrounding content.

.opencode/skills/Utilities/Cloudflare/SKILL.md-51-52 (1)

51-52: ⚠️ Potential issue | 🟡 Minor

Account-ID-Variablenname ist inkonsistent zu Workflows/Create.md.

In Line 51 steht CF_ACCOUNT_ID, im Create-Workflow wird CLOUDFLARE_ACCOUNT_ID verwendet. Einheitliche Benennung reduziert Fehlkonfigurationen.
🤖 Prompt for AI Agents
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Cloudflare/SKILL.md around lines 51 - 52, The ENV
name for the Cloudflare account is inconsistent: SKILL.md uses CF_ACCOUNT_ID
while Workflows/Create.md uses CLOUDFLARE_ACCOUNT_ID; update SKILL.md to use
CLOUDFLARE_ACCOUNT_ID (or vice‑versa across docs) so the environment variable
name is consistent, and ensure the referenced symbol CF_ACCOUNT_ID in the
"Account ID" line is replaced with CLOUDFLARE_ACCOUNT_ID and any related
examples or README references are aligned.

.opencode/skills/Utilities/Cloudflare/Workflows/Create.md-18-25 (1)

18-25: ⚠️ Potential issue | 🟡 Minor

Code-Fence ohne Sprachangabe (MD040).

Der Block ab Line 18 sollte mit einer Sprache markiert werden (hier z. B. text), damit Markdownlint sauber bleibt.
🔧 Vorschlag
-```
+```text
 workers/mcp-server-name/
 ├── src/
 │   └── simple.js      # Main worker code
 ├── wrangler.toml      # Cloudflare config
 ├── package.json       # Dependencies
 └── README.md          # Documentation
</details>

<details>
<summary>🤖 Prompt for AI Agents</summary>
Verify each finding against the current code and only fix it if needed.

In @.opencode/skills/Utilities/Cloudflare/Workflows/Create.md around lines 18 -
25, The fenced code block in Create.md (the directory tree starting with
"workers/mcp-server-name/") is missing a language tag which triggers MD040;
update that block to include a language identifier (e.g., add ```text instead of
.opencode/skills/Utilities/Cloudflare/Workflows/Create.md and prepend the
language token to the opening backticks for the sample directory listing.

.opencode/skills/Utilities/Documents/SKILL.md-27-28 (1)

27-28: ⚠️ Potential issue | 🟡 Minor

Fenced code blocks ohne Sprachangabe (MD040).

Bitte den Blöcken eine Sprache geben (z. B. text), damit Markdown-Lint stabil bleibt.

✅ Beispiel-Fix

-```
+```text
 Running the **WorkflowName** workflow in the **Documents** skill to ACTION...

</details>


Also applies to: 204-205, 213-214, 222-223

<details>
<summary>🤖 Prompt for AI Agents</summary>