We are looking for an enthusiastic software engineer to join our AI networking acceleration team, to work on a groundbreaking open-source library, using hardware offloads, GPU Kernels and RDMA network cards. Our product is a performance-oriented low-level infrastructure, crafted to change the way inference works. We thrive as a team in a deeply strong environment, and we're passionate about innovation.
Responsibilities
* Developing a highly optimized inference framework
* Running on the world’s largest supercomputers and data centers
* Work in a dynamic and challenging environment on innovative, next-generation products focused on performance, scalability, and features
What We Need To See (Qualifications)
* B.Sc. or equivalent experience in Computer Science or Software Engineering
* 8 years’ experience in modern C++ / C / Rust development
* 3 years’ experience in Linux environment and familiarity with development tools
* Deep knowledge of the TCP/IP network stack
* Understanding of computer architecture and operating systems concepts
Ways To Stand Out
* Hands-on experience with LLM inference stacks
* Expertise in distributed storage technologies
* Background in Linux internals and low-level software optimizations (benchmarking, bottleneck research, performance tuning)
* Experience in programming CUDA kernels is an advantage
* Background in parallel programming / high-performance computing / RDMA technology
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-Ljbffr