Member of Technical Staff, AI Pretraining Platform
Join to apply for the Member of Technical Staff, AI Pretraining Platform role at Microsoft Innovation Center
Member of Technical Staff, AI Pretraining Platform
2 weeks ago Be among the first 25 applicants
Get AI-powered advice on this job and more exclusive features.
Overview
Help build the world’s most advanced training platform at Microsoft AI
We are on a mission to create the leading pretraining platform to develop the world’s most capable AI frontier models. This platform will span one of the world’s foremost GPU clusters, pushing the boundaries of scale, performance, and reliability.
The AI Pretraining Platform team at Microsoft AI is responsible for all aspects of infrastructure including scalability, benchmarking, kernel development, performance optimizations, communications, and fault tolerance to support our model pre-training operations. We are an interdisciplinary team of engineers and scientists, learning from each other, and collaborating to create the best models, methods, and products. We work closely with the teams that transform pre-trained models into the models that power the consumer Copilot experience.
About
We are looking for outstanding individuals excited about contributing to the next generation of systems that will transform the field. We are seeking candidates who:
* Are passionate about infrastructure enabling large-scale AI model training
* Thrive in a highly collaborative, fast-paced environment
* Have a high degree of craftsmanship and pay close attention to details
* Demonstrate a proactive attitude and enthusiasm for exploring new methods and technologies
* Can effectively manage multiple responsibilities and adjust to shifting priorities
Responsibilities
* Design and develop Python and CUDA/HIP C++ code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data
* Build and maintain cutting-edge infrastructure to store and process petabytes of data
* Partner with pretraining and post-training teams to improve data recipes through rigorous experimentation
* Collaborate with product teams and other engineers and researchers across Microsoft AI to identify gaps in current models
* Embody our culture and values
Qualifications
* Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling, or data engineering
* OR Master's Degree in the same fields AND experience in related work
* OR equivalent experience
* Experience with HPC and/or parallel programming
* Experience in pretraining
* Experience working with GPU clusters
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration without regard to various protected characteristics. For accommodations during the application process, please use the provided form.
#J-18808-Ljbffr