How It Works Features Blog Platform Book a Demo
Case Study
Oracle Cloud Infrastructure

Scaling to 95B Daily Requests on Oracle Cloud Infrastructure

Partner Oracle
Category Infrastructure
Scale 95B Daily Requests
Published June 11, 2026
+40% Performance improvement
95B Daily ad requests
~50ms Typical response time

Infrastructure limits at the edge of scale

Quantive's platform sits in the critical path of every ad request our publishers make. Every bid, floor calculation, and demand decision has to happen in the few milliseconds before an impression renders. As our network grew toward tens of billions of daily requests, our infrastructure started showing its limits.

Latency spikes during peak hours, uneven scaling across regions, and rising compute costs were all making it harder to deliver the consistent, real-time performance our optimization engine depends on. We needed an infrastructure foundation that could scale elastically with traffic without sacrificing speed or reliability.

Built for latency-sensitive, high-throughput workloads

After evaluating several cloud providers, we migrated our core serving infrastructure to Oracle Cloud Infrastructure (OCI). A few factors made the difference for a workload like ours:

01

High-Performance Compute & Networking

Kept tail latency predictable even under heavy, bursty load. critical for real-time bidding at scale.

02

Generous, Low-Cost Egress

Bandwidth costs matter enormously when you're moving billions of requests a day. OCI's pricing model made the economics work.

03

Flat, Non-Oversubscribed Network

Reduced jitter between our services and bidding partners, improving consistency across every auction.

04

Regional Flexibility

Let us place compute closer to our publishers and demand partners, cutting round-trip times for globally distributed traffic.

A complete re-architecture with zero downtime

We treated the migration as an opportunity to re-architect from the ground up. The work happened in three phases:

01

Re-Architecting for Elasticity

We containerized our serving stack and rebuilt autoscaling around real-time request volume rather than fixed capacity.

02

Region-Aware Routing

Requests are now routed to the nearest healthy region, cutting round-trip times for globally distributed traffic.

03

Zero-Downtime Cutover

We shifted traffic gradually with shadow testing and progressive rollout, so publishers saw no disruption during the transition.

Immediate, measurable impact across every metric

By moving to OCI and re-architecting around it, we achieved a 40% improvement in overall platform performance. Our infrastructure now comfortably scales to 95 billion ad requests per day, with headroom to spare during peak events.

+40% Overall platform performance improvement
95B Daily ad requests handled
~50ms Typical response time
Most importantly for our publishers, response times stayed fast and consistent: under 100ms at the 99th percentile, with typical responses around 50ms.

Speed and scale translate directly into revenue

Faster responses mean we participate in more auctions before they time out. Reliable scaling means consistent performance during the traffic surges that come with big events. And a more efficient infrastructure means we can keep investing in the AI that drives our optimization, rather than in raw compute overhead.

The OCI migration is a foundation, not a finish line. It gives us the room to keep growing our network while holding the line on the sub-100ms performance our publishers count on.

"

"Migrating to OCI let us stop worrying about whether the infrastructure could keep up, and focus entirely on making smarter monetization decisions. Faster responses, massive scale, and lower cost - all at once."

— Quantive Engineering Team
Back to Blog