So much of agent traffic is repetitive. By adding in caching layers, you can both improve your user experience and decrease your bills in one fell swoop.
Caching
Caching is one of the simplest ways to reduce cost and improve responsiveness, but the details change across agents, APIs, data systems, and content delivery. This hub collects practical caching articles and examples.
Articles
Serving images from S3 worked for years, until it didn't. Here's how I moved image optimization out of my workflow and made performance the default.
I've been building APIs for years, but recently discovered the power of inferring data from user sessions for a better developer experience.
The location of your serverless app matters more than you might think. You could add significant latency and cost if you don't take it into consideration.
I built a serverless app that asks ChatGPT to write workouts for me every day. I couldn't be happier with the results.
A huge part of serverless API design is handling retries or accidental resubmits. Without it, data integrity goes out the window.
Whenever we talk about serverless, there's always the one person who brings up cold starts.
Designing applications for the right amount of scale is a huge architectural task. With serverless part of it is handled for you, but some of it you're on your own.