Rate limiting
AssessTechniques
A control technique that limits how often requests can be made in a given time window.
Why it's here
Placed in Assess: 3 article(s) of evidence from 2 source(s), led by open-source activity, with 1 in the last 30 days. Confidence 49%.
Evidence (3)
- 4InfoQ·6/8/2026open_sourceValkey Strategies for Faster Caching and Real-Time Workloads
Viktor Vedmich explains how engineering leaders can improve application performance with Valkey, highlighting its 100% API compatibility with Redis. He also covers caching patterns such as lazy loading and the use of Valkey data structures for real-time analytics, rate limiting, and session storage to help mitigate thundering herd issues.
- 6OpenAI Blog·2/13/2026framework_updateOpenAI scales access to Codex and Sora beyond simple rate limits
OpenAI describes a real-time access system that combines rate limits, usage tracking, and credits to provide continuous access to Codex and Sora. The approach is intended to manage demand more flexibly than fixed rate limits alone while keeping usage controlled.
- 7OpenAI Blog·1/22/2026open_sourceHow OpenAI scaled PostgreSQL for ChatGPT
OpenAI describes how it scaled PostgreSQL to handle millions of queries per second for ChatGPT. The post highlights the use of replicas, caching, rate limiting, and workload isolation to improve performance and reliability at very large scale.