Deep Learning Architect, LLM Inference

About this role

NVIDIA is seeking a Deep Learning Architect specializing in LLM Inference. This role will focus on designing and optimizing high-performance inference solutions for large language models, likely involving NVIDIA's own hardware and software stacks. You'll be at the forefront of making advanced AI models deployable and efficient.