Category: Uncategorized
-
PTU Spillover and Token Tracking in Microsoft Foundry
Why spillover matters for cost attribution, chargeback, and observability in provisioned AI deployments In this post, I’ll walk through how PTU spillover works in Microsoft Foundry, why it complicates token tracking, and what you…
-
Token-Level Chargeback for Azure OpenAI Batch API Using APIM, Event Hub, and Durable Functions
Batch APIs are fantastic for high-volume, asynchronous workloads—but they come with a challenge every architect eventually runs into: How do you track and charge back token consumption when thousands of completions are bundled into…