NVIDIA
Santa Clara, CA
Deep Learning Architect, LLM Inference
About this role
NVIDIA is seeking a Deep Learning Architect specializing in LLM Inference. This role will focus on designing and optimizing high-performance inference solutions for large language models, likely involving NVIDIA's own hardware and software stacks. You'll be at the forefront of making advanced AI models deployable and efficient.