Deep Learning Performance Engineer

Job Description

Posted on: 
October 20, 2023

As a Deep Learning Performance Engineer, you will help build systems and optimizations that push the boundaries of performance across cutting edge ML models. This is an incredibly critical role to Anyscale as it allows us to provide market leading performance and price point for AI infrastructure.


  • Iterate very quickly with product teams to ship the latest optimizations to Anyscale platform, Anyscale Endpoints, and various open source offerings.
  • Work closely with research teams on LLM engines like vLLM, TensorRT-LLM
  • Follow the latest state-of-the-art in the open source and the research community, implementing and extending best practices

Job Requirements

  • Prior experience working on GPUs / CUDA
  • Solid understanding of operating systems and/or networking fundamentals and experience in such optimizations
  • Familiarity with deep learning and deep learning frameworks (e.g. PyTorch)

Bonus points!

  • ML Systems knowledge
  • Experience training deep learning models
  • Contributions to deep learning frameworks (PyTorch, TensorFlow)
  • Contributions to deep learning compilers (Triton, TVM, MLIR)
  • Experience using Ray
Apply now

More job openings