Case Study

How We Helped an EdTech Platform Scale 50x

EdTech
Industry
Education / EdTech - Not-for-Profit
Services
Cloud & DevOps, Architecture Design, Backend Development, Frontend Development, Quality Assurance
Company Size & Location
Not-for-Profit Organisation & India
Technology Stack
PHP (CodeIgniter, Laravel) · Kotlin · MySQL (AWS RDS) · Redis (ElastiCache) · Amazon ALB · AWS S3 · Amazon CloudFront · AWS SES · Twilio · Cloudflare
Team
Solution Architect · DevOps Engineer · Backend Developers · Frontend Developers · QA Engineer
Timeline
Project-Based Engagement
01

Client Vision

QUEST Alliance is a not-for-profit focused on research-led innovation in teaching and learning. Its digital learning platform, the Quest App available on web and mobile, and supported by Accenture helps young people across the country build 21st-century employability skills, with the goal of reaching 1 million learners and helping them develop the skills, critical awareness, and confidence to seize opportunities and improve their communities.

With a major onboarding push of 250,000 users on the horizon, QUEST needed to be certain the platform could withstand the load. The catch: the existing application could handle only about 200 concurrent users and needed to scale to at least 10,000 - a 50x jump.

02

Challenge

Scaling an application 50x isn’t about adding servers - it’s about finding what breaks first. The Quest App was about to go from a few hundred concurrent users to tens of thousands, and the existing architecture wasn’t built for it.

A 50x Load Jump  

The application could handle only ~200 concurrent users but had to support at least 10,000, ahead of onboarding 250,000 new users.

Unknown Bottlenecks  

The existing system broke at higher loads for reasons that first had to be diagnosed across the database, web-server configuration, and infrastructure.

Availability During Releases  

Scaling couldn’t come at the cost of downtime; releases needed to be safe and low-risk.

DNS & CDN Interference  

Route 53 and Cloudflare introduced DNS issues, and Cloudflare caused load-testing timeouts at the 50x target.

03

Solution

Focaloid started with a technical analysis to find the bottlenecks, then implemented a targeted set of infrastructure, database, and web-server changes to carry the platform to 10,000+ concurrent users.

Diagnosed the Bottlenecks  

Ran a technical analysis of the existing system to identify exactly what was causing it to break under load.

Scaled the Database  

Upgraded the Amazon RDS instance from R5.Large to R5.Xlarge and updated database indexing for query performance.

Horizontal Auto-Scaling + Load Balancing  

Implemented horizontal auto-scaling (2 → 4 medium servers based on load) behind an Application Load Balancer, with round-robin instance switching and sticky sessions enabled.

Tuned the Web Servers  

Updated mpm_prefork.conf and MaxRequestWorkers across all servers to handle far more concurrent requests.

Offloaded File Storage  

Moved uploaded and generated files to Amazon S3.

04

Our Approach

Focaloid worked diagnosis-first, then changed only what the load demanded, and validated everything under stress.

Analyze  

Diagnosed the bottlenecks causing failures at higher loads, across database, web servers, and infrastructure.

Re-Architect for Scale  

Upgraded the database, added horizontal auto-scaling and load balancing, tuned web-server concurrency, and offloaded files to S3.

Deploy Safely  

Implemented Blue-Green deployment via ALB to cut downtime and release risk.

Load-Test & Resolve  

Ran load testing to the 50x target, identified and fixed the Cloudflare-related DNS and timeout issues, and validated capacity.

05

Result / Impact

For the Client

  • 50x scale achieved from ~200 to 10,000+ concurrent users
  • 10,000+ hits/second handled in load testing after the changes
  • Optimized cloud cost versus running 3–4 instances in parallel, through auto-scaling
  • Ready to onboard 250,000 users, toward the 1 million-learner goal

For Reliability & Performance

  • Improved availability and reduced downtime via Blue-Green deployment
  • Elastic capacity scaling 2 → 4 servers automatically based on load
  • Faster database performance through instance upgrade and indexing
  • Higher concurrency from tuned web-server configuration

For the Mission

  • A platform that can scale with QUEST’s reach toward 1 million learners
  • Lower cloud cost, freeing budget for the mission
  • Confidence to onboard large user cohorts without performance risk

06

Why It Matters

For a non-profit working to put employability skills in the hands of a million young people, a platform that buckles at a few hundred users is a hard ceiling on impact. Scaling it 50x - safely, and without simply throwing money at always-on servers is what turns an ambition into a reachable goal. By diagnosing the real bottlenecks and re-architecting for elastic scale, tuned concurrency, and zero-downtime releases, Focaloid took the Quest App from ~200 to 10,000+ concurrent users while keeping cloud costs in check. That’s headroom the mission can actually grow into.

Let's build

Need your platform to handle the load that’s coming?

We diagnose the real bottlenecks and re-architect for elastic scale auto-scaling, load balancing, database tuning, and zero-downtime Blue-Green deployments - so you can grow without breaking or overspending.