Responsibilities
* Develop algorithms, model architectures, data mixtures, and scaling laws for large-scale training using a rigorous data-driven approach grounded in meticulous ablations.
* Drive algorithmic implementations, conduct experiments, and oversee flagship training runs on our in-house large-scale distributed stack.
* Collaborate closely with teams on infrastructure, data, post-training, and multimodality.
* Embody our culture and values.
Qualifications
Minimum Qualifications:
* Bachelor's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, data modeling or data engineering work.
* OR Master's Degree in Computer Science, Math, Software Engineering, Computer Engineering, or related field AND experience in business analytics, data science, software development, or data engineering work.
* OR equivalent experience.
* Experience working with frameworks or libraries for model pre-training, such as TensorFlow, PyTorch, or Hugging Face Transformers.
Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable laws, regulations and ordinances.
If you need assistance and/or a reasonable accommodation due to a disability during the application or the recruiting process, please send a request via the Accommodation request form.
Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.
#J-18808-Ljbffr