💪 It did not disappoint. 👉TL:DR - It made all AI use cheaper in my environment, made me more efficient, and it increased my memory system usage by almost half. It should be no secret to anyone here: I don't count tokens. 🪙 I'm on a Pro Max plan and I go where the work, and the passion takes me. But I have used Fable before, and I know about the token burn! So I came up with a plan! I pointed Fable at my own setup and pulled the report. 😅 I'm glad I did. Turns out it could've been cheaper and cleaner the whole time. I didn't ask Fable what it thinks. I pointed it at the real thing. My hooks, my handoff files, my subagent config, the actual token counts on disk. Two questions: where does the money go? & how to we spend less? 💡The answers. A silent forgotten tax on every message, A safety hook was injecting around 800 tokens into every prompt I sent. Repeated rules I already load once at the top. And a file that runs at every session start called itself " 450 tokens" in its own header. It was not... it was 7,000. My actual face when I saw this--->🤬 a few moments later--->😆 (Apparently, it's not only AI that is bad at counting sometimes!) My own file lied to me, and I'd read past that number a hundred times with a smile on my face! My long sessions. The ones I'm proudest of. Turns out that hour six is my most expensive and also least impactful at the same time. 💩 I know that models follow instructions worse at higher context. And because the quality was still there, the marathon I read as momentum was being billed to me at top rates for the weakest output of the day. 🙄 My helper agents were all running on the flagship (Most expensive model). These are the agents Claude spins up to send off to do tasks to get more accomplished in a shorter amount of time! This I knew about, and it was by choice, it's my environment and I never hit my 5 hour or weekly cap, so I did not care about this for myself, bigger = better right? Use Fable 5 on UltraCode in my environment, and we find out that logic was wrong....