Responsibilities (Text Only)
Design and develop Python and CUDA/HIP C++ code that enable distributed training of multimodal LLMs ingesting text, audio, images, or video data.
Build and maintain cutting-edge infrastructure capable of storing and processing petabytes of data needed to power models.
Partner with pretraining and post-training teams to improve data recipes through rigorous experimentation.
Collaborate with the product team, engineers, and researchers across Microsoft AI to identify gaps in current models.
Embody our culture and values.
Qualifications (Text Only)
Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling, or data engineering work.
OR Master's Degree in the same fields AND experience in business analytics, data science, software development, or data engineering work.
OR equivalent experience.
Experience with HPC (High Performance Computing) and/or parallel programming.
Experience in the area of pretraining.
Experience working with GPU clusters.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations, and ordinances.
If you need assistance and/or a reasonable accommodation due to a disability during the application or recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#J-18808-Ljbffr