Apache Aurora has been managing Twitter's production services on Mesos for three years, and now powers the majority of compute at the company. In that time, our team has adapted Aurora's scheduler in many ways to be robust, usable, and performant. In this talk, Bill will discuss robustness features critical to Aurora's success, and explain the features we're interested in building in the scheduler.
This talk will be useful for anyone involved in writing or maintaining services, as well as Mesos framework developers. Audience members will learn about how the Aurora scheduler works, and gain an understanding of the features that make it resilient enough to power twitter.com.
This talk aims to share design decisions, mistakes made, and features developed over several years of writing and maintaining a business-critical Mesos scheduler. This will help promote our open-source software as well as share our experience with other Mesos framework developers.
Survey this Session