The Agentic ToolbenchChapter 30 of 73April 26, 2026
Project Level Analytics Summary
I wish GPT/Codex gave me a project-level analytics summary. Not rate limits. Not token usage.
Something more like:
- How many prompts did I actually write to create this thing?
- How many separate threads did I start?
- How many times did I have to course-correct?
- Which parts took the most back-and-forth?
- Where did the model nail it, and where did I have to keep pushing?

I was working on a project all day and realized I now have hundreds of threads. But that number does not really tell the story, because each thread has multiple prompts, fixes, refinements, and “no, that’s not what I meant” moments.
I would love to see a recap of the actual collaboration.