NVIDIA

NVIDIA Senior Engineer in GPU Inference

📍 Location
toronto, on
⏰ Job Type
Full-time
📅 Posted
June 04, 2026
Apply Now

Job Description

Advance your career with NVIDIA as a Senior Engineer, focusing on GPU inference systems for AI. Drive optimization and collaboration while enhancing performance across large-scale models.
In this crucial role, you will architect high-performance inference stacks and fine-tune NVIDIA's GPU solutions to achieve top productivity. Your expertise will significantly contribute to hitting industry benchmarks and implementing advanced GPU kernels within a multi-cloud environment.
Key Responsibilities:
• Develop and optimize vLLM features with cutting-edge GPU technology
• Benchmark and profile GPU kernels for enhanced efficiency
• Create robust tools for inference benchmarking methods
• Spearhead orchestration of large-scale inference deployments
• Publish innovative research to elevate machine learning systems
Requirements:
• Extensive background in computer science with advanced degree options
• Proficient in Python, C/C++, and GPU programming languages
• Str...

Ready to Apply?

Take the next step in your career - we're hiring now!

Apply for this Position