docs: design for bundled-prompts overhaul

Externalize all prose prompts to editable files, collapse system+agent, add an inline roadblock protocol detected by StreamAnalyzer. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>
2026-06-04 13:20:24 +02:00
parent 75aa42b877
commit ad7d74820a
1 changed files with 231 additions and 0 deletions
--- a/docs/superpowers/specs/2026-06-04-bundled-prompts-overhaul-design.md
+++ b/docs/superpowers/specs/2026-06-04-bundled-prompts-overhaul-design.md
@@ -0,0 +1,231 @@
 # Bundled Prompts Overhaul — Design
 Date: 2026-06-04
 ## Goal
 Replace ClaudeDo's bundled prompts with a clean, professional baseline and make
 every prose prompt a user-editable file with a bundled default. Add a roadblock
 protocol so an autonomous run can flag problems mid-task without aborting.
 The execution-side defaults (`system.md`) ship as a moderate, **project-agnostic**
 engineering baseline — ClaudeDo users run tasks against their *own* repos, so no
 ClaudeDo-specific rules belong there. Everything is in English (tighter
 tokenization, more reliable instruction-following); the only German output is the
 weekly report, which a human reads.
 ## File layout
 All prompts live under `~/.todo-app/prompts/` as editable files with bundled
 defaults seeded by `PromptFiles.EnsureExists` (which never overwrites a file the
 user already has). The `system` + `agent` prompts collapse into one `system.md`;
 the old `agent`/manual distinction was removed when tags were retired.
 | File | Replaces | Placeholders |
 |---|---|---|
 | `system.md` | system + agent (merged) | — |
 | `planning-system.md` | planning system prompt | — |
 | `planning-initial.md` | "analyze & break down" kickoff | `{title}`, `{description}` |
 | `retry.md` | "try again and fix" prompt | — |
 | `daily-prep.md` | daily-prep prompt | `{date}`, `{maxTasks}` |
 | `weekly-report.md` | weekly-report instructions | `{start}`, `{end}` |
 The task-execution prompt (title + description + `## Sub-Tasks` checkboxes) stays
 assembled in code — it is data-shaped, not prose.
 ### Templating
 `PromptFiles` gains `Render(PromptKind kind, IReadOnlyDictionary<string,string> values)`
 that replaces **only** the known named tokens for that kind. Any other `{...}` in
 the file (e.g. the literal `{Wochentag}` / `{dd.MM.yyyy}` in the German report
 rules) passes through untouched. Daily-prep tool names are inlined as literals —
 `--allowedTools` already carries the real names, and inlining keeps the file from
 silently breaking if a user edits a placeholder.
 ### Migration
 `EnsureExists` keeps its current semantics: it seeds a default only when the file
 is missing, never overwriting user edits. The old `planning.md` and `agent.md`
 become inert — `TaskRunner` stops reading `agent.md`, and the planning system
 prompt now reads `planning-system.md`. Old files are harmless to leave or delete.
 `PromptKind` changes: `Agent` is removed; `Planning` maps to `planning-system.md`;
 new kinds `PlanningInitial`, `Retry`, `DailyPrep`, `WeeklyReport` are added.
 ## Roadblock protocol
 An autonomous run has no human watching, so it must not silently stop or block on
 a question. Instead the agent emits an inline marker whenever it hits a true
 blocker, **any number of times**, and keeps working on whatever it still can.
 - **Prompt side** (`system.md`): instruct the agent to write
  `CLAUDEDO_BLOCKED: <one short sentence>` on its own line whenever something
  genuinely prevents progress (missing credentials, contradictory requirements, a
  destructive action it won't take unasked) — then continue with the rest of the
  task. Reserved for true blockers, not routine decisions it can make itself.
 - **Detection** (`StreamAnalyzer`): as `assistant` messages stream, scan their
  text content for lines matching `^CLAUDEDO_BLOCKED:` and collect each reason
  into an ordered list (`Blocks`). This is live and cumulative — multiple problems
  across one run are all captured, not just the last.
 - **Result wiring** (`StreamResult` → `RunResult` → run record): carry the
  collected `Blocks`. Strip the marker lines from the displayed result text.
 - **Routing**: a run that finishes with blocks still goes to `WaitingForReview`
  (standalone tasks) — it is "done as far as the agent could get". The review card
  shows a ⚠ roadblock hint listing the collected problems. The user answers them
  via the existing reject-rerun feedback path, which resumes the session with the
  answers as the next-turn prompt — so the agent continues with the problems
  resolved rather than restarting.
 ## The prompts
 ### `system.md`
 ```markdown
 # Working Agreement
 You are completing one well-defined task autonomously in a git repository.
 ## Scope
 - Do exactly what the task asks — no unrequested refactors, renames, dependency
  changes, or "while I'm here" cleanup.
 - If intent is ambiguous, state the assumption you're making and proceed with the
  most reasonable reading. Stop only if you genuinely cannot move forward.
 - Prefer three similar lines over a premature abstraction. Don't build for
  hypothetical future needs.
 ## Working in the repo
 - Read a file before editing it. Match the conventions already in this codebase —
  they override generic defaults.
 - Prefer editing existing files to creating new ones. Don't write comments that
  just restate the code.
 - Validate only at real boundaries (user input, external APIs).
 ## Finishing
 - Before claiming done, verify: run the build and relevant tests, confirm they
  pass, and report what you ran. If you couldn't verify something, say so plainly.
 - Make focused commits using the repository's existing commit-message convention.
 ## Safety
 - Never force-push, hard-reset, or delete branches/files beyond the task's scope
  without being asked.
 - Don't introduce injection/XSS/secret-leak issues. Never commit credentials.
 ## You are running unattended
 You run autonomously with no human watching. There is no one to answer mid-task
 questions, so never stop to ask — make the most reasonable decision, note the
 assumption, and continue.
 ## When you are blocked
 If something genuinely prevents you from completing part of the task (missing
 credentials, contradictory requirements, a destructive action you won't take
 unasked), do NOT silently give up. Write this marker on its own line, then keep
 working on whatever else you can:
 CLAUDEDO_BLOCKED: <one short sentence describing what blocked you>
 Emit it as many times as needed — once per distinct blocker. Use it only for true
 blockers, not for routine decisions you can make yourself.
 ```
 ### `planning-system.md`
 ```markdown
 You are the planning assistant for ClaudeDo. Your job is to break a task into
 smaller, independently executable subtasks — the session ends by creating those
 subtasks.
 Start every session by invoking the `superpowers:brainstorming` skill (Skill
 tool) and follow it end to end: clarifying questions one at a time, then 2–3
 approaches with a recommendation, then a short design. Do not create any subtasks
 until the user has approved the design.
 You can ONLY shape this task's plan — you cannot edit files or touch other tasks.
 The tools available to you are: CreateChildTask, ListChildTasks, UpdateChildTask,
 DeleteChildTask, UpdatePlanningTask, and Finalize. Use nothing else.
 Once the design is approved, create the child tasks with CreateChildTask, then
 call Finalize. Keep each subtask concrete and self-contained with a clear
 done-state, ordered so dependencies come first.
 ```
 ### `planning-initial.md`
 ```markdown
 # Task to plan: {title}
 {description}
 ```
 ### `retry.md`
 ```markdown
 The task did not complete on the previous attempt — you may have run out of
 turns, hit an error, or stopped before finishing.
 Review the work already done in this session and the current state of the
 repository, identify what is still incomplete or broken, and finish the task.
 Don't restart from scratch or repeat a failed approach. Verify the result
 (build + tests) before you stop.
 ```
 Self-contained — no error injection. The runner appends the captured process
 output **only when it is a genuine error** (i.e. not the generic
 `"Claude exited with code N and no result."` fallback), since real session errors
 are already in the resumed context.
 ### `daily-prep.md`
 ```markdown
 You are preparing my workday for {date}.
 1. Call mcp__claudedo__get_daily_prep_candidates.
 2. Keep tasks already marked MyDay (currentMyDay) — never remove them.
 3. Fill MyDay to at most {maxTasks} open tasks TOTAL (currentMyDay counts). Never exceed it.
 4. Estimate each candidate's effort and pick a feasible mix — not only big items.
   Prioritize isStarred, due (scheduledFor), and older tasks.
 5. Place related tasks next to each other using consecutive sortOrder values.
 6. Apply via mcp__claudedo__set_my_day(taskId, true, sortOrder). Never mark anything
   outside the candidate list.
 If there are no candidates, do nothing.
 ```
 ### `weekly-report.md`
 ```markdown
 You are generating a concise weekly standup report for a software developer,
 covering {start} to {end}.
 Rules:
 - Write the ENTIRE report in German.
 - Group by day. One "## {Wochentag}, {dd.MM.yyyy}" section per day that has
  activity (German weekday names). Omit days with no activity.
 - Within each day: 3–5 first-person, past-tense bullets ("- Habe X umgesetzt",
  "- Y behoben"). Merge related small work into one bullet.
 - Drop trivia: typo fixes, pure exploration, false starts, tooling/log noise.
 - Blend the developer's own notes and the derived activity into ONE deduplicated
  bullet list per day. The notes are authoritative — never omit or contradict them.
 - Name the project/repo when it adds clarity.
 - Output ONLY the dated sections. No preamble, no intro, no closing remarks.
 Two sections follow below: an activity log derived from Claude session history,
 and the developer's own notes. Base the report on both; the notes are
 authoritative where they conflict with the derived activity.
 ```
 ## Touch points
 - `src/ClaudeDo.Data/PromptFiles.cs` — new `PromptKind` members, new defaults,
  `Render` helper.
 - `src/ClaudeDo.Worker/Runner/TaskRunner.cs` — stop reading `agent.md`; use
  `retry.md`; conditional stderr append on retry; carry/route `Blocks`.
 - `src/ClaudeDo.Worker/Runner/StreamAnalyzer.cs` — scan assistant text for
  `CLAUDEDO_BLOCKED:` markers, collect `Blocks`, strip from result.
 - `src/ClaudeDo.Worker/Runner/ClaudeProcess.cs` / `RunResult` — carry `Blocks`.
 - `src/ClaudeDo.Worker/Planning/PlanningSessionManager.cs` — read
  `planning-system.md` and `planning-initial.md` via `PromptFiles.Render`.
 - `src/ClaudeDo.Worker/Prime/DailyPrepPrompt.cs` — read `daily-prep.md`.
 - `src/ClaudeDo.Worker/Report/WeekReportPromptBuilder.cs` — read `weekly-report.md`.
 - UI — review card shows the ⚠ roadblock hint with collected problems.
 - `src/ClaudeDo.Ui/.../FilesSettingsTabViewModel.cs` — expose the new prompt files.
 - Tests — `PromptFiles` render/seed; `StreamAnalyzer` marker collection; planning/
  prep/report builders read from files.
 ## Out of scope
 - The in-code task-execution assembly (title/description/subtasks) is unchanged.
 - `ResultSchema` / `--output-schema` remains untouched.
 - No change to commit-message templating.
 ```