Anyscale

ML Tech Lead

Job Description

Posted on: 
November 8, 2023

We're looking for passionate, motivated people who are excited to build infrastructure and tooling for next generation machine learning applications. We're hiring exceptional Software Engineers for the distributed training team at Anyscale, which is responsible for building and maintaining open source machine learning libraries widely adopted across the industry.


We are particularly looking for Senior or staff and above candidates who can help cast and execute on a vision for the future of machine learning training infrastructure. We are open to both Individual Contributors and people who are primarily technical but have prior experience managing a small team.

Responsibilities

  • Build performant, scalable, fault-tolerant distributed machine learning libraries that power the next generation of machine learning platforms around the world
  • Work on difficult architectural problems and turn them into reality
  • Work with a team of leading experts in the areas of distributed systems and machine learning
  • Work with engineering managers and product managers to lead and grow an extremely talented team of software engineers
  • Work closely with open source community (with ML researchers, ML engineers, data scientists) to scope and build new abstractions for scalable machine learning
  • You like to work closely with end users and iterate on the product with them
  • Help us build and shape a world-class company

Job Requirements

  • 7+ years of building, scaling and maintaining software systems in production environments
  • Solid fundamentals in algorithms, data structures, system design
  • Experience with machine learning frameworks and libraries (PyTorch, Tensorflow)
  • Experience designing fault-tolerant distributed systems
  • Strong architectural skills

Bonus points!

  • Experience working with a cloud technology stack (AWS, GCP, Kubernetes)
  • Experience building machine learning training pipelines or inference services in a production setting
  • Experience with managing and maintaining open source machine learning libraries
  • Experience managing small teams in pursuit of an ambitious technical goal
  • Experience using Ray

Apply now

More job openings