Feranmi's Thoughts

Sign in Subscribe

Topic

AI

A collection of 4 issues

LLM Cost Control in Production: Multi-Level Caching for AI Products

There's a moment every AI product builder hits, usually around week three of production traffic, where the OpenAI dashboard stops being exciting and starts being alarming. The spend curve is vertical. The unit economics don't work. And the painful realisation sets in that "call the

Two-Tier LLM Pipelines: Cost Firewalls for Production AI

The first time you check your OpenAI bill after a real traffic spike, something changes in you permanently. It's not the number itself it's the realisation that every engineering decision you made in development, every "just call the API" shortcut, every missing cache, is

Grounding AI in High-Stakes Domains: When the LLM Must Never Produce the Number

A few months ago I shipped two products within weeks of each other. One computes your Nigerian income tax from a bank statement. The other decides whether a small business gets a BNPL loan. Different domains, different users, different stakes — but the backend architecture converged on the same rule: the

Building a Reusable AI SDK

Recently, I’ve found myself building AI enabled applications, like the AI content optimization platform “https://buffbyteai.xyz/” and a Anaemia detection using computer models https://nailtechapp.netlify.app/, some of my major problems is managing the api keys for my LLM providers, dynamic prompts, variable prefill, AI persona’s,