The AWS Developers Podcast
Episode 201
Mar 25, 26 • 01:13:58
With Alexander Lisachenko, Software Architect, inDrive, and Artem Gab, Senior Engineering Manager, Runtime Systems, inDrive
Join us for a fascinating conversation with Alexander 'Sasha' Lisachenko (Software Architect) and Artem Gab (Senior Engineering Manager) from inDrive, one of the global leaders in mobility operating in 49 countries and processing over 8 million rides per day. Sasha and Artem take us through their four-year transformation journey from a monolithic bare-metal setup in a single data center to a fully cloud-native microservices architecture on AWS. They share the hard-earned lessons from their migration, including critical challenges with Redis cluster architecture, the discovery of single-threaded CPU bottlenecks, and how they solved hot key problems using Uber's H3 hexagon-based geospatial indexing. We dive deep into their migration from Redis to Valkey on ElastiCache, achieving 15-20% cost optimization and improved memory efficiency, and their innovative approach to auto-scaling ElastiCache clusters across multiple dimensions. Along the way, they reveal how TLS termination on master nodes created unexpected bottlenecks, how connection storms can cascade when Redis slows down, and why engine CPU utilization is the one metric you should never ignore. This is a story of resilience, technical problem-solving, and the reality of large-scale cloud transformations — complete with rollbacks, late-night incidents, and the eventual triumph of a fully elastic, geo-distributed platform serving riders and drivers across the globe.
Links
Here are the links to the tools, technologies, or articles we mentioned in this episode.
Redis in Action — Josiah L. Carlson (Manning)
AWS Well-Architected Framework — ElastiCache Lens
Brendan Gregg's Blog — Performance Analysis & Observability
Uber H3 — Hexagonal Hierarchical Spatial Index
inDrive Website
AWS ElastiCache Documentation
Valkey Project
AWS Well-Architected Framework