Log retention storage cost: steady-state GB-month explained

Reviewed by CloudCostKit Editorial Team. Last updated: 2026-01-27. Editorial policy and methodology.

Start with a calculator if you need a first-pass estimate, then use this guide to validate the assumptions and catch the billing traps.


Log bills usually separate ingestion (GB ingested) from retention/storage (GB-month). Retention is the lever that quietly turns a stable ingestion rate into a growing long-term cost.

The steady-state model (the key idea)

If you retain logs for N days and ingest roughly GB/day, then once you've been logging longer than N days, the stored dataset is roughly:

  • Stored GB ≈ GB/day x N

Storage cost is then stored GB x $/GB-month (using your provider billing model).

Retention tiers (hot vs cold)

A common pattern is to keep a short "hot" window for interactive troubleshooting and a longer "cold" window for compliance or rare investigations:

  • Hot: 7-14 days at full fidelity (fast queries)
  • Cold: 30-180 days archived or stored cheaply (slower retrieval)

Worked example (sanity-check your model)

Suppose you ingest 10 GB/day and retain logs for 30 days. Once you reach steady state, stored logs are roughly 300 GB. If storage is priced per GB-month, you can estimate monthly storage cost as 300 GB x ($/GB-month), then add ingestion and any query/scan costs separately.

  • If you increase retention from 30 days to 90 days at the same ingest rate, stored GB roughly triples.
  • If you reduce ingest (sampling, filtering, structured logs), retention cost falls linearly too.

Do not forget query/scan behavior

Storage is only one part of "retention cost". If your team frequently searches or scans logs (especially during incidents), scan/search can become a second spike. Keep scan volume and search frequency as explicit inputs.

Tools: Log scan/search cost, Log cost overview.

Fast levers (reduce cost without losing signal)

  • Shorten retention for low-value debug logs; keep longer retention for sparse high-value security/audit logs.
  • Sample noisy logs and reduce per-request log bytes (structured fields, fewer stack traces).
  • Move cold retention to cheaper tiers/object storage if your provider supports it.
  • Set defaults in IaC to prevent retention drift across new log groups/services.

Common pitfalls

  • Retention drift: default policies slowly expand across more log groups.
  • High-volume debug logs: long retention on verbose logs dominates storage.
  • Mixing units: GB vs GiB across dashboards and pricing tables.

Tools

Sources


Related guides

Log costs explained: ingestion, retention, scan, and incident spikes
Estimate log costs from ingestion volume, retention, and query or scan fees, with practical guidance for noisy sources, retention tiers, and incident-time query spikes.
Azure Log Analytics pricing: ingestion, retention, and query costs
A practical model for Log Analytics-style costs: GB ingested, retention storage, and query/scan behavior. Includes a method to estimate log GB from event rate and payload size, plus a validation checklist for high-volume sources.
CloudFront logs cost: estimate storage, retention, and queries
How to estimate CloudFront log costs: log volume (GB/day), retention (GB-month), and downstream query/scan costs (Athena/SIEM). Includes practical cost-control levers.
Cloud Logging pricing (GCP): ingestion, retention, and query scans
A practical model for Cloud Logging costs: GB ingested, retention storage (GB-month), and query/scan behavior. Includes a fast method to estimate GB/day from events/sec × bytes/event and a checklist to find dominant sources.
Estimate ECR storage (GB-month) from images and retention
How to estimate container registry storage cost: average image size, push frequency, retention window, multi-arch duplication, and a workflow to validate your estimate.
Estimate RDS backup storage (GB-month) from retention and churn
A practical method to estimate RDS backup storage (GB-month): start from daily changed data, retention days, and sanity-check with snapshot sizes. Includes common mistakes that inflate backup cost.

Related calculators


FAQ

Why does retention increase costs so much?
Storage is billed per GB-month. If ingestion stays flat, longer retention multiplies the steady-state stored dataset size.
What is the steady-state model for retained logs?
If you retain N days of logs and ingest GB/day consistently, the retained dataset is roughly GB/day x N once you've been logging longer than N days.
How do I reduce storage cost without losing signal?
Shorten hot retention for low-value logs, keep longer retention for sparse high-value logs, and archive cold logs to cheaper object storage if needed.

Last updated: 2026-01-27. Reviewed against CloudCostKit methodology and current provider documentation. See the Editorial Policy .