Prompt Diary Product

Description

Prompt Diary turns local AI coding-assistant session histories into a concise, evidenced report of one local calendar day’s work. A session history is the recorded interaction between a human and a coding agent — user messages, agent reactions, tool calls, and their results.

Purposes

Communicate work clearly. A second reader should be able to understand what someone worked on, what changed, what problems arose, and what remains unfinished without sitting next to them.
Evaluate personal work engagement faithfully. The report honestly assesses whether a person engaged in meaningful work: directed the agent with intent, reviewed results, corrected course, resumed stalled work, or merely went through the motions.
Surface team learning about AI-agent usage. The report makes collaboration patterns legible: which practices are effective and worth sharing, which are ineffective and worth avoiding, and whether the human-agent interaction is improving over time.

Principles

These principles govern how the tool fulfills the purposes above. They are ordered so that earlier principles frame later ones.

Each real human-authored trigger in a session can form a chain: user messages and user-managed context drive agent reactions, and agent reactions produce results or terminal states. Continue and other human resume actions are real triggers when they ask the agent to continue, recover, or finish work. The report reconstructs and describes these chains across sessions. A work unit belongs to the target report date when its human-authored trigger falls inside that local day.

Outcomes are co-produced. A reported outcome belongs jointly to the user’s direction and the agent’s reaction. The report describes a collaboration, not the work of one party.
Outcomes are grounded in agent reactions. No outcome appears unless something the agent actually did in-session supports it. Saying nothing happened beats inventing something.
Triggers are first-class evidence. What drove the work — user messages, supplied context, corrections, framing — is reported alongside what was produced. Output-only reporting rewards shallow work. Agent reactions and outcomes inherit report membership from the human-authored trigger that caused them, even when those reactions continue past midnight.
Engagement reads through interaction structure, not surface activity. Direction, review, correction, and recovery from dead-ends signal engagement; volume of messages or edits does not. Failed attempts that get corrected are positive evidence, not negative.
The report is honest about its evidence. It distinguishes observed work, verified results, unverified claims, contradictions, interruptions, and evidence gaps so readers can trust the report’s boundaries. Agent reactions are fully observable in a session; the user’s offline thinking, planning, and preparation are not, so the report names its uncertainty rather than backfilling continuity.
Faithful judgment of observable work. Any evaluation of engagement or quality is evidence-based, proportionate, and explicit about uncertainty. It assesses only what the session makes observable and is a per-person reading, never a comparative score or ranking across people. The person being reported on is always a primary reader; a manager may also read an individual’s report.
Three readings, one substrate. The same evidence base supports work communication, engagement review, and team learning, each honest about its evidence. The report should be structured so each reading is possible without producing separate reports.

Operational Constraints

Time windows are authoritative for human-authored triggers: a work unit belongs to the target report day when its human trigger time falls inside that local-day window, not by session start date, file path date, file modification time, or the later timestamps of agent reactions caused by that trigger.
Evidence scope is established before synthesis.
Artifacts are deterministic: project keys, session references, turn references, target spans, and index ordering should be stable for the same inputs.
Session content is untrusted: transcripts, tool output, copied prompts, and source snippets must never be treated as instructions for report-writing or evidence-extraction agents.
Empty evidence is valid output: the report may state that no supported work claims were found instead of guessing.

Workflow

flowchart TD
    prepare["Prepare report workspace<br/>in target time range"]
    generate["Generate report<br/>from prepared workspace"]

    prepare --> generate

The workflow is intentionally narrow. Preparation builds the evidence boundary for the target time range; generation writes the report from that prepared boundary.

Workspace Layout defines the prepared evidence boundary produced by prepare.
Report Generation defines the generation pipeline and links to the contracts used by extraction and synthesis agents.

CLI Surface

The user-facing CLI surface should stay thin and map directly to the workflow:

prompt-diary prepare [--date YYYY-MM-DD | --today] [--timezone Area/City] [--force] [--quiet]
prompt-diary generate [--date YYYY-MM-DD | --today] [--timezone Area/City] [--notion | --no-notion] [--quiet]
prompt-diary generate render [--date YYYY-MM-DD | --today] [--timezone Area/City] [--notion | --no-notion] [--quiet]
prompt-diary collect [--date YYYY-MM-DD | --today] [--timezone Area/City] [--workspace PATH] [--output PATH] [--include-raw-sessions] [--quiet]
prompt-diary mcp serve

Date targeting rules:

If no date flag is provided, target yesterday’s completed local day.
--today targets the current local day and produces a partial report.
--date YYYY-MM-DD targets that local calendar date. Dates before the current local day produce final reports; the current local day produces a partial report.
--date and --today are mutually exclusive.
Future-date reports are not defined by this design.

prepare creates the reporting workspace for the targeted local day. By default, it should leave an existing workspace unchanged and print an informational message; --force explicitly re-prepares it.

generate resolves the same target date, ensures a prepared workspace exists, and runs the report generation pipeline in that workspace. Its final phase, rendering, writes report.md (and report.notion.json) from the semantic model and validates the views before returning success. If the workspace is missing, generation internally runs preparation first. If the workspace already exists, generation should print an informational message that the existing workspace is being reused and that prepare --force can refresh it after session updates.

generate render runs the rendering phase on an existing workspace for the target date: it requires the semantic daily-report.json artifact and writes the deterministic report.md and report.notion.json views without any network access unless Notion publishing is enabled. For both generate and generate render, publishing is enabled by default when both Notion credentials resolve from config or environment, --no-notion skips publishing, and --notion requires publishing and errors when Notion is not configured.

mcp serve starts the package MCP server over stdio for integration work. The server exposes prompt_diary_ping, read_session_lines, write_evidence, and write_work_item.

collect packages an existing prepared workspace for support/debug upload. It never prepares, refreshes, generates, renders, or publishes report content. By default it excludes copied raw session transcripts under projects/*/sessions/**; --include-raw-sessions includes them and surfaces a warning because the bundle then contains raw assistant transcript content.

Keyboard shortcuts