Tag

Posts tagged ai

4 posts. Browse all tags

  1. Prompt Caching: The 30% You're Leaving on the Table

    Most developers optimize model choice, context size, and output quality, but never notice the 20-35% of their bill that disappears with one config flag. Here's what prompt caching actually is, how five major providers implement it differently, and what happens when your tooling silently breaks it.

  2. Bigger Context Windows Don't Make Better AI

    The arms race to 1M context windows assumes more context is better. Research from Chroma proved it makes things worse - and labs have been expanding context windows ever since.