Anyscale

Engineering Manager (Infrastructure Reliability)

Job Description

Posted on: 
February 27, 2023

Anyscale is looking for a strong Engineering Manager to lead the Infrastructure Reliability team

The mission of this team is to keep all user-facing services and Anyscale production systems running smoothly. We want to delight our users by ensuring reliability of our systems and this team plays a critical role in achieving this. Anyscale runs on top of cloud components, and the team will be responsible for developing a unified perspective on how these cloud components are used across the company. This includes processes for provisioning, negotiating prices, managing costs, seeing opportunities for teams to reduce wastage by finding applications across the company. The team will be responsible for managing incident handling and upleveling our process and maturity.

In this position, you will guide the vision, recruit, and enable a high-performing team. This team is critical to everything that Anyscale and Ray achieve and will play a pivotal role in the success of the company.

Responsibilities

  • Drive the strategy of the Infrastructure Reliability group at Anyscale
  • Hiring / Team Building: Recruit new employees and coach/mentor engineers
  • Focus on building and maintaining a culture of collaboration within the team
  • Communicate your work to a broader audience through talks, tutorials, and blog posts
  • Help us to build and shape a world-class company

Lead and guide the team in:

  • Developing a unified perspective on how cloud components are used across the company
  • Build systems that allow us to understand what’s happening in production so that when there is an issue we can identify it quickly. This involves helping to build common observability infrastructure for metrics, logging, monitoring and tracing
  • Build tools to measure if we are meeting service level objectives, and defining what the SLOs should be across the organization

Job Requirements

  • Experience building infrastructure reliability teams
  • Solid engineering management experience leading productive, high-performing teams
  • Managing through other managers is a plus
  • Record of helping teams scale quickly while maintaining a good culture
  • A great track record of execution
  • Mindset towards achieving results, and excellent prioritization skills
  • Effective communication

Apply now

More job openings