Azure Functions pricing: what to include in a realistic estimate

Reviewed by CloudCostKit Editorial Team. Last updated: 2026-01-27. Editorial policy and methodology.

Start with a calculator if you need a first-pass estimate, then use this guide to validate the assumptions and catch the billing traps.

Log Cost Calculator Log Ingestion Cost Calculator Log Retention Storage Cost Calculator Log Search Scan Cost Calculator

This is the Azure Functions execution and pricing boundary page. Use it when you need to estimate Functions from the real execution model instead of from a generic serverless average.

Go back to the serverless parent guide if the event-system model is still unclear and you have not yet separated retries, fan-out, downstream dependencies, and observability from the Functions bill itself.

0) Pick the right unit of analysis (what triggers what)

Before you do any math, decide what actually drives executions. Many teams accidentally use "API requests" when the real driver is "queue messages" or "cron jobs".

HTTP-triggered: driver is request volume (RPS, requests/month).
Queue/topic/event-triggered: driver is message/event rate (events/s, events/day).
Timer/cron-triggered: driver is schedule count (runs/day).
Fan-out patterns: one trigger can create N downstream invocations (the hidden multiplier).

1) Invocation volume (requests/month)

Convert your driver into monthly invocations. If HTTP-triggered, convert RPS to requests/month. If event-driven, do the same conversion from events/s to events/month and then multiply by any "fan-out" behavior.

Tool: RPS to monthly requests.

Keep a separate line for retries. A 2x retry rate doubles both invocations and downstream dependencies.
Include non-user traffic (health checks, bots, cron pings) if it hits your functions.

2) Duration and memory (the compute multiplier)

Duration is the multiplier that turns invocations into compute. Model at least two percentiles (p50 and p95) so you can capture normal behavior and slow-path behavior. Slow paths are usually caused by upstream latency (DB, APIs), cold starts, and throttling.

Track work per invocation: CPU-bound work scales differently than I/O-bound waiting.
If you batch messages, model both message rate and messages per invocation.
If you use durable/workflow functions, treat orchestration and activity calls as separate invocation buckets.

3) Networking and payload size (egress is often the surprise)

Outbound data transfer can become a major driver when functions call external APIs, return large responses, or stream data. Estimate response size and outbound volume separately from compute so you can see what really dominates.

Tools: Response transfer, Egress cost.

Model external calls as their own line item (API calls, DB calls) because retries amplify them.
Split traffic by destination if needed: same-region, cross-region, and internet have different billing behavior.

4) Logs and observability (ingestion + retention + scan)

Logging scales with volume. If each invocation logs a few KB and you have millions of invocations, log ingestion can exceed compute. Model ingestion, retention storage, and any query/scan cost separately.

Tools: Log ingestion, Retention storage, Search/scan.

Separate platform logs (access/diagnostics) from application logs (structured logs).
Keep a peak scenario: incident logging often increases volume by 2-10x.

Worked estimate template (copy/paste)

Invocations/month = baseline + peak (include retries and fan-out)
Compute = invocations/month * avg duration (p50/p95) * memory tier (if applicable)
Egress GB/month = invocations/month * avg response size (GB) + external API egress
Log GB/month = invocations/month * avg log bytes/invocation (sample real payloads)

Common pitfalls

Using one average duration instead of modeling the slow path (p95 matters).
Ignoring retries/timeouts, which multiply invocations, duration, and dependency calls.
Putting secret fetches or config calls on the hot path (see Key Vault guide).
Returning large payloads without modeling transfer/egress separately.
Verbose logs per invocation (log ingestion dominates at scale).

How to validate

Measure invocations and duration over a representative window (baseline + peak).
Validate retry rates and timeout behavior during incident windows.
Sample real log payload size and multiply by invocation volume.
Validate response sizes/top endpoints by bytes (avoid blended averages).