loop: do slug generation outside the agent loop [This commit message written entirely by a human; it is all useful.] We can make a slug based on the first message. It's good enough. That keeps it--and the slug tool--out of the agent's context. It's also one fewer step for extremely short Sketch runs, which is the straw that broke this particular camel's back. This is a mild UI regression, in that there's a slight stall after the user types their first message, during which the slug is being generated. See (2) below. While we're here, add handling of compaction agent messages. This leaves two big TODOs outstanding: 1. Untangle the awful rats nest that is slug and branch management; we have callbacks back and forth and layers and locking and it's all confusing. One visible for that this ugliness takes is that every time the agent tries out a slug, the top bar in the webui updates, even if we then reject that slug as a duplicate. there are other forms of ugliness, just less visible. 2. Make slug generation concurrent with the rest of the agent, to avoid a short stall right after the user's first request (ick). When we make slug setting concurrent, we'll likely need to resuscitate the bashPermissionCheck, except it'll be "silently block and wait for background slug generation to complete", rather than "reject the tool call". Ditto for about_sketch, and any other tool call that expects the slug or branch name to be set. Generally, before undertaking this effort, we should fix (1) above, make convos generally concurrency safe (maybe COW?), and figure out to get race-enabled innie builds. Co-Authored-By: sketch <hello@sketch.dev> Change-ID: s8ac5f6a9faa611ebk

commit: 3b44cc3120b595e82dcc2a6081b8a07fc24c739e [log] [tgz]
author: Josh Bleecher Snyder <josharian@gmail.com> Tue Jul 22 02:28:14 2025 +0000
committer: Josh Bleecher Snyder <josharian@gmail.com> Tue Jul 22 20:10:27 2025 -0700
tree: 3d4f3c2911390428c0c89aa8dcd13dbfb035ce31
parent: f18aafd177a90b928b1c1babbe5e763f85eab56d [diff] [blame]
diff --git a/loop/testdata/agent_loop.httprr b/loop/testdata/agent_loop.httprr
index 25aca32..b2ac507 100644
--- a/loop/testdata/agent_loop.httprr
+++ b/loop/testdata/agent_loop.httprr

@@ -1,9 +1,9 @@
 httprr trace v1
-20567 2526
+20094 2538
 POST https://api.anthropic.com/v1/messages HTTP/1.1

 Host: api.anthropic.com

 User-Agent: Go-http-client/1.1

-Content-Length: 20369

+Content-Length: 19896

 Anthropic-Version: 2023-06-01

 Content-Type: application/json

 

@@ -15,7 +15,7 @@
    "content": [
     {
      "type": "text",
-     "text": "What tools are available to you? Please just list them briefly. (Do not call the set-slug tool.)",
+     "text": "What tools are available to you? Please just list them briefly.",
      "cache_control": {
       "type": "ephemeral"
      }
@@ -189,22 +189,6 @@
    }
   },
   {
-   "name": "set-slug",
-   "description": "Set a short slug as an identifier for this conversation.",
-   "input_schema": {
-    "type": "object",
-    "properties": {
-     "slug": {
-      "type": "string",
-      "description": "A 2-3 word alphanumeric hyphenated slug, imperative tense"
-     }
-    },
-    "required": [
-     "slug"
-    ]
-   }
-  },
-  {
    "name": "commit-message-style",
    "description": "Provides git commit message style guidance. MANDATORY: You must use this tool before making any git commits.",
    "input_schema": {
@@ -592,7 +576,7 @@
  ],
  "system": [
   {
-   "text": "You are the expert software engineer and architect powering Sketch,\nan agentic coding environment that helps users accomplish coding tasks through autonomous analysis and implementation.\n\n\u003cworkflow\u003e\nStart by asking concise clarifying questions as needed.\nOnce the intent is clear, work autonomously.\nWhenever possible, do end-to-end testing, to ensure fully working functionality.\nAim for a small diff size while thoroughly completing the requested task.\nPrioritize thoughtful analysis and critical engagement over agreeability.\n\nCall the set-slug tool as soon as the topic of conversation is clear, often immediately.\n\nBreak down the overall goal into a series of smaller steps.\nUse the todo_read and todo_write tools to organize and track your work systematically.\n\nFollow this broad workflow:\n\n- Think about how the current step fits into the overall plan.\n- Do research. Good tool choices: bash, think, keyword_search\n- Make edits.\n- If you have completed a standalone chunk of work, make a git commit.\n- Update your todo task list.\n- Repeat.\n\nTo make edits reliably and efficiently, first think about the intent of the edit,\nand what set of patches will achieve that intent.\nThen use the patch tool to make those edits. Combine all edits to any given file into a single patch tool call.\n\nYou may run tool calls in parallel.\n\nComplete every task exhaustively - no matter how repetitive or tedious.\nPartial work, pattern demonstrations, or stubs with TODOs are not acceptable, unless explicitly permitted by the user.\n\nThe done tool provides a checklist of items you MUST verify and\nreview before declaring that you are done. Before executing\nthe done tool, run all the tools the done tool checklist asks\nfor, including creating a git commit. Do not forget to run tests.\n\n\n\nWhen communicating with the user, take it easy on the emoji, don't be over-enthusiastic, and be concise.\n\nDocker is available. Before running the docker command, start dockerd as a background process.\nAlways use --network=host when running docker containers.\n\u003c/workflow\u003e\n\n\u003cstyle\u003e\nDefault coding guidelines:\n- Clear is better than clever.\n- Minimal inline comments: non-obvious logic and key decisions only.\n\u003c/style\u003e\n\n\u003csystem_info\u003e\n\u003cplatform\u003e\nlinux/amd64\n\u003c/platform\u003e\n\u003cpwd\u003e\n/\n\u003c/pwd\u003e\n\u003c/system_info\u003e\n\n\u003cgit_info\u003e\n\u003cgit_root\u003e\n\n\u003c/git_root\u003e\n\u003cHEAD\u003e\nHEAD\n\u003c/HEAD\u003e\n\n\u003c/git_info\u003e\n\n",
+   "text": "You are the expert software engineer and architect powering Sketch,\nan agentic coding environment that helps users accomplish coding tasks through autonomous analysis and implementation.\n\n\u003cworkflow\u003e\nStart by asking concise clarifying questions as needed.\nOnce the intent is clear, work autonomously.\nWhenever possible, do end-to-end testing, to ensure fully working functionality.\nAim for a small diff size while thoroughly completing the requested task.\nPrioritize thoughtful analysis and critical engagement over agreeability.\n\nBreak down the overall goal into a series of smaller steps.\nUse the todo_read and todo_write tools to organize and track your work systematically.\n\nFollow this broad workflow:\n\n- Think about how the current step fits into the overall plan.\n- Do research. Good tool choices: bash, think, keyword_search\n- Make edits.\n- If you have completed a standalone chunk of work, make a git commit.\n- Update your todo task list.\n- Repeat.\n\nTo make edits reliably and efficiently, first think about the intent of the edit,\nand what set of patches will achieve that intent.\nThen use the patch tool to make those edits. Combine all edits to any given file into a single patch tool call.\n\nYou may run tool calls in parallel.\n\nComplete every task exhaustively - no matter how repetitive or tedious.\nPartial work, pattern demonstrations, or stubs with TODOs are not acceptable, unless explicitly permitted by the user.\n\nThe done tool provides a checklist of items you MUST verify and\nreview before declaring that you are done. Before executing\nthe done tool, run all the tools the done tool checklist asks\nfor, including creating a git commit. Do not forget to run tests.\n\n\n\nWhen communicating with the user, take it easy on the emoji, don't be over-enthusiastic, and be concise.\n\nDocker is available. Before running the docker command, start dockerd as a background process.\nAlways use --network=host when running docker containers.\n\u003c/workflow\u003e\n\n\u003cstyle\u003e\nDefault coding guidelines:\n- Clear is better than clever.\n- Minimal inline comments: non-obvious logic and key decisions only.\n\u003c/style\u003e\n\n\u003csystem_info\u003e\n\u003cplatform\u003e\nlinux/amd64\n\u003c/platform\u003e\n\u003cpwd\u003e\n/\n\u003c/pwd\u003e\n\u003c/system_info\u003e\n\n\u003cgit_info\u003e\n\u003cgit_root\u003e\n\n\u003c/git_root\u003e\n\u003cHEAD\u003e\nHEAD\n\u003c/HEAD\u003e\n\n\u003c/git_info\u003e\n\n",
    "type": "text",
    "cache_control": {
     "type": "ephemeral"
@@ -601,26 +585,26 @@
  ]
 }HTTP/2.0 200 OK

 Anthropic-Organization-Id: 3c473a21-7208-450a-a9f8-80aebda45c1b

-Anthropic-Ratelimit-Input-Tokens-Limit: 200000

-Anthropic-Ratelimit-Input-Tokens-Remaining: 184000

-Anthropic-Ratelimit-Input-Tokens-Reset: 2025-07-11T01:09:41Z

-Anthropic-Ratelimit-Output-Tokens-Limit: 80000

-Anthropic-Ratelimit-Output-Tokens-Remaining: 80000

-Anthropic-Ratelimit-Output-Tokens-Reset: 2025-07-11T01:09:42Z

+Anthropic-Ratelimit-Input-Tokens-Limit: 2000000

+Anthropic-Ratelimit-Input-Tokens-Remaining: 2000000

+Anthropic-Ratelimit-Input-Tokens-Reset: 2025-07-23T00:51:23Z

+Anthropic-Ratelimit-Output-Tokens-Limit: 400000

+Anthropic-Ratelimit-Output-Tokens-Remaining: 400000

+Anthropic-Ratelimit-Output-Tokens-Reset: 2025-07-23T00:51:27Z

 Anthropic-Ratelimit-Requests-Limit: 4000

 Anthropic-Ratelimit-Requests-Remaining: 3999

-Anthropic-Ratelimit-Requests-Reset: 2025-07-11T01:09:33Z

-Anthropic-Ratelimit-Tokens-Limit: 280000

-Anthropic-Ratelimit-Tokens-Remaining: 264000

-Anthropic-Ratelimit-Tokens-Reset: 2025-07-11T01:09:41Z

+Anthropic-Ratelimit-Requests-Reset: 2025-07-23T00:51:20Z

+Anthropic-Ratelimit-Tokens-Limit: 2400000

+Anthropic-Ratelimit-Tokens-Remaining: 2400000

+Anthropic-Ratelimit-Tokens-Reset: 2025-07-23T00:51:23Z

 Cf-Cache-Status: DYNAMIC

-Cf-Ray: 95d46a83bdfc16a2-SJC

+Cf-Ray: 963730571c2eeb29-SJC

 Content-Type: application/json

-Date: Fri, 11 Jul 2025 01:09:42 GMT

-Request-Id: req_011CQzLcTmJPByqxoE27HKVH

+Date: Wed, 23 Jul 2025 00:51:27 GMT

+Request-Id: req_011CRP2vuuf7xj5V5QqTaEuS

 Server: cloudflare

 Strict-Transport-Security: max-age=31536000; includeSubDomains; preload

 Via: 1.1 google

 X-Robots-Tag: none

 

-{"id":"msg_019kDFP5RwG2yYoYeTuPLXzZ","type":"message","role":"assistant","model":"claude-sonnet-4-20250514","content":[{"type":"text","text":"Here are the tools available to me:\n\n**File & Code Management:**\n- `bash` - Execute shell commands\n- `patch` - Modify files with precise text edits\n- `keyword_search` - Search for files in unfamiliar codebases\n\n**Planning & Organization:**\n- `think` - Take notes and form plans\n- `todo_read` / `todo_write` - Manage task lists\n- `done` - Complete work with verification checklist\n\n**Git & Code Review:**\n- `commit-message-style` - Get commit message guidance\n- `codereview` - Run automated code review\n\n**Browser Automation:**\n- `browser_navigate` - Navigate to URLs\n- `browser_click` - Click elements\n- `browser_type` - Type into inputs\n- `browser_wait_for` - Wait for elements\n- `browser_get_text` - Read page text\n- `browser_eval` - Execute JavaScript\n- `browser_scroll_into_view` - Scroll to elements\n- `browser_resize` - Resize browser window\n- `browser_take_screenshot` - Capture screenshots\n- `browser_recent_console_logs` - Get console logs\n- `browser_clear_console_logs` - Clear console logs\n\n**Utilities:**\n- `read_image` - Read and encode image files\n- `about_sketch` - Get help with Sketch functionality\n- `multiplechoice` - Present multiple choice questions"}],"stop_reason":"end_turn","stop_sequence":null,"usage":{"input_tokens":3,"cache_creation_input_tokens":4719,"cache_read_input_tokens":0,"output_tokens":329,"service_tier":"standard"}}
\ No newline at end of file
+{"id":"msg_01MsAHgr98YWT6BpHDitnPPM","type":"message","role":"assistant","model":"claude-sonnet-4-20250514","content":[{"type":"text","text":"Here are the tools available to me:\n\n**File & Code Operations:**\n- `bash` - Execute shell commands\n- `patch` - Edit files with precise text modifications\n- `keyword_search` - Search codebase by keywords and concepts\n\n**Project Management:**\n- `todo_read` / `todo_write` - Manage task lists\n- `think` - Record thoughts and planning\n- `done` - Complete work with verification checklist\n\n**Git & Code Quality:**\n- `commit-message-style` - Get git commit message guidance\n- `codereview` - Run automated code review\n\n**Browser Automation:**\n- `browser_navigate` - Navigate to URLs\n- `browser_click` - Click elements\n- `browser_type` - Type into inputs\n- `browser_wait_for` - Wait for elements\n- `browser_get_text` - Read page text\n- `browser_eval` - Execute JavaScript\n- `browser_scroll_into_view` - Scroll to elements\n- `browser_resize` - Resize browser window\n- `browser_recent_console_logs` - Get console logs\n- `browser_clear_console_logs` - Clear console logs\n- `browser_take_screenshot` - Take screenshots\n\n**Utilities:**\n- `read_image` - Read and encode image files\n- `multiplechoice` - Present multiple choice questions\n- `about_sketch` - Get help with Sketch functionality"}],"stop_reason":"end_turn","stop_sequence":null,"usage":{"input_tokens":3,"cache_creation_input_tokens":4606,"cache_read_input_tokens":0,"output_tokens":323,"service_tier":"standard"}}
\ No newline at end of file
commit	3b44cc3120b595e82dcc2a6081b8a07fc24c739e	[log] [tgz]
author	Josh Bleecher Snyder <josharian@gmail.com>	Tue Jul 22 02:28:14 2025 +0000
committer	Josh Bleecher Snyder <josharian@gmail.com>	Tue Jul 22 20:10:27 2025 -0700
tree	3d4f3c2911390428c0c89aa8dcd13dbfb035ce31
parent	f18aafd177a90b928b1c1babbe5e763f85eab56d [diff] [blame]