NVIDIA
Santa Clara, CA, US; New York, NY, US; Remote, US

Developer Technology Engineer, CPU Performance - New College Grad 2026

Hybrid$124,000 – $241,500/yrPosted 2 days agoWebsiteLinkedIn

Skip the busywork

ApplyBolt rewrites your resume for this exact role and hits submit. You just pick the jobs.

Resume tailored to this roleApplied in secondsTrack every application
Download the app

About this role

What you will be doing:

  • In this role, you will research and develop techniques to accelerate large scale applications running on NVIDIA’s family of advanced CPU platforms.
  • Work directly with other technical experts in their fields (industry and academia) to perform in-depth analysis and optimization of complex data intensive and compute intensive workloads to ensure the best possible performance on modern hardware architecture focused on CPU performance.
  • Publish and present discovered optimization techniques in developer blogs or relevant conferences to engage and educate the Developer community.
  • Influence the design of next-generation hardware architectures, software, and programming models in collaboration with research, hardware, system software, libraries, and tools teams at NVIDIA

What we need to see:

  • Pursuing or recently completed a BS, MS or PhD in Computer Science, Computer Engineering, or related field (or equivalent experience)
  • Relevant work or research experience.
  • Knowledge of modern CPU architectures (ARM, x86) and system/OS
  • Experience with CPU architecture fundamentals, especially memory subsystem (cache DRAM, storage.)
  • Hands-on experience with low-level parallel and system programming, SIMD vectorization, CPU intrinsics and concurrent data structures.
  • Programming fluency in modern C/C++ with a deep understanding of algorithms, concurrency, and other optimization techniques.
  • Good communication and organization skills, with a logical approach to problem solving, and prioritization skills.

Ways to stand out from the crowd:

  • Experience optimizing the performance of distributed database systems and frameworks (e.g. production database or Spark).
  • Background with video transcoding, compression, storage systems, networking, and distributed computer architectures.
  • Knowledge of ARM SoCs (AMBA CHI, SMMU) and GPU architectures.

The DevTech Engineer plays a crucial role in the success of NVIDIA and our customers. DevTechs work with external technologists to investigate performance of their applications, design parallel algorithms and implement optimizations in an accelerated computing environment. As recognized experts in the field we publish our findings in developer blogs or at relevant conferences and workshops. With insight to our customers, the industry, and academia we are important representatives of NVIDIA as a technology leader. Within NVIDIA we contribute valuable application expertise that influences next generation hardware and software products. As critical problem solvers, we deepen our expertise, expand our knowledge, and work across domains and organizations. Whether you are a leading industry luminary or early in your career, the DevTech Team provides ample opportunity for growth in the exciting field of accelerated computing!

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Does contributing to and innovating in HPC, Accelerated Computing and Artificial Intelligence excite you? If so, we want to hear from you.