loop: improve update_tests.sh reliability
Encouraging the agent to call the title tool immediately
has its drawbacks!
diff --git a/loop/testdata/agent_loop.httprr b/loop/testdata/agent_loop.httprr
index 1bd6c8b..d0535d3 100644
--- a/loop/testdata/agent_loop.httprr
+++ b/loop/testdata/agent_loop.httprr
@@ -1,9 +1,9 @@
httprr trace v1
-14412 1940
+14442 2108
POST https://api.anthropic.com/v1/messages HTTP/1.1
Host: api.anthropic.com
User-Agent: Go-http-client/1.1
-Content-Length: 14214
+Content-Length: 14244
Anthropic-Version: 2023-06-01
Content-Type: application/json
@@ -15,7 +15,7 @@
"content": [
{
"type": "text",
- "text": "What tools are available to you? Please just list them briefly.",
+ "text": "What tools are available to you? Please just list them briefly. (Do not call the title tool.)",
"cache_control": {
"type": "ephemeral"
}
@@ -448,24 +448,24 @@
Anthropic-Organization-Id: 3c473a21-7208-450a-a9f8-80aebda45c1b
Anthropic-Ratelimit-Input-Tokens-Limit: 200000
Anthropic-Ratelimit-Input-Tokens-Remaining: 200000
-Anthropic-Ratelimit-Input-Tokens-Reset: 2025-05-07T21:04:03Z
+Anthropic-Ratelimit-Input-Tokens-Reset: 2025-05-08T01:38:36Z
Anthropic-Ratelimit-Output-Tokens-Limit: 80000
Anthropic-Ratelimit-Output-Tokens-Remaining: 80000
-Anthropic-Ratelimit-Output-Tokens-Reset: 2025-05-07T21:04:06Z
+Anthropic-Ratelimit-Output-Tokens-Reset: 2025-05-08T01:38:39Z
Anthropic-Ratelimit-Requests-Limit: 4000
Anthropic-Ratelimit-Requests-Remaining: 3999
-Anthropic-Ratelimit-Requests-Reset: 2025-05-07T21:04:02Z
+Anthropic-Ratelimit-Requests-Reset: 2025-05-08T01:38:28Z
Anthropic-Ratelimit-Tokens-Limit: 280000
Anthropic-Ratelimit-Tokens-Remaining: 280000
-Anthropic-Ratelimit-Tokens-Reset: 2025-05-07T21:04:03Z
+Anthropic-Ratelimit-Tokens-Reset: 2025-05-08T01:38:36Z
Cf-Cache-Status: DYNAMIC
-Cf-Ray: 93c3aadcc98122de-SJC
+Cf-Ray: 93c53cdd7943cf6d-SJC
Content-Type: application/json
-Date: Wed, 07 May 2025 21:04:06 GMT
-Request-Id: req_011CNtr7wwfjVqgXeQpBjTz9
+Date: Thu, 08 May 2025 01:38:39 GMT
+Request-Id: req_011CNuD3jHzhgZkGcjWJdPL5
Server: cloudflare
Strict-Transport-Security: max-age=31536000; includeSubDomains; preload
Via: 1.1 google
X-Robots-Tag: none
-{"id":"msg_01CFPaREtZTNs6Ui1suT3SAG","type":"message","role":"assistant","model":"claude-3-7-sonnet-20250219","content":[{"type":"text","text":"I have access to the following tools:\n\n1. bash - Execute shell commands\n2. keyword_search - Search for files with specific terms\n3. think - Record thoughts and planning\n4. title - Set conversation title\n5. precommit - Create a git branch for tracking work\n6. done - Conclude task and complete checklist\n7. codereview - Run automated code review\n8. multiplechoice - Present options to the user\n9. Browser tools (navigate, click, type, etc.)\n10. patch - Modify files with precise text edits\n\nThese tools allow me to navigate codebases, execute commands, modify files, and interact with web interfaces to help you with your tasks."}],"stop_reason":"end_turn","stop_sequence":null,"usage":{"input_tokens":4,"cache_creation_input_tokens":3359,"cache_read_input_tokens":0,"output_tokens":155}}
\ No newline at end of file
+{"id":"msg_01HE6EfBKA7eRMNJBt73orhe","type":"message","role":"assistant","model":"claude-3-7-sonnet-20250219","content":[{"type":"text","text":"I can provide a brief list of the tools available to me:\n\n1. bash - Execute shell commands\n2. keyword_search - Search for files based on keywords \n3. think - Record thoughts and plans without external effects\n4. title - Set the conversation title\n5. precommit - Create git branches for tracking work\n6. done - Mark completion of tasks with a checklist\n7. codereview - Run automated code review\n8. multiplechoice - Present multiple choice options\n9. browser_* tools - A set of tools for browser interaction:\n - browser_navigate, browser_click, browser_type, browser_wait_for, \n - browser_get_text, browser_eval, browser_screenshot, browser_scroll_into_view\n10. patch - Make precise text edits to files\n\nThese tools allow me to help with various coding tasks, particularly in Go development."}],"stop_reason":"end_turn","stop_sequence":null,"usage":{"input_tokens":4,"cache_creation_input_tokens":3367,"cache_read_input_tokens":0,"output_tokens":199}}
\ No newline at end of file