NVIDIA
Santa Clara, CA

Deep Learning Architect, LLM Inference

OnsitePosted todayWebsiteLinkedIn

About this role

NVIDIA is seeking a Deep Learning Architect specializing in LLM Inference. This role will focus on designing and optimizing high-performance inference solutions for large language models, likely involving NVIDIA's own hardware and software stacks. You'll be at the forefront of making advanced AI models deployable and efficient.