Amazon

Sr. ML Kernel Performance Engineer, AWS Neuron, Annapurna Labs

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
May 29, 2026
Apply Now

Job Description

The Annapurna Labs team at Amazon builds Neuron, the software development kit used to accelerate deep learning and GenAI workloads on Amazon’s custom machine learning accelerators, Inferentia and Trainium.

The Acceleration Kernel Library team is at the forefront of maximizing performance for Amazon's custom ML accelerators. Working at the hardware-software boundary, our engineers craft high-performance kernels for ML functions, ensuring every FLOP counts in delivering optimal performance for our customers' demanding workloads. We combine deep hardware knowledge with ML expertise to push the boundaries of what's possible in AI acceleration.

The Amazon Neuron SDK, developed by the Annapurna Labs team at Amazon, is the backbone for accelerating deep learning and GenAI workloads on Amazon's Inferentia and Trainium ML accelerators. This comprehensive toolkit includes an ML compiler, runtime, and application framework that seamlessly integrates with popular ML framework...

Ready to Apply?

Take the next step in your career - we're hiring now!

Apply for this Position