Software engineer reinforcement learning

Zürich

Promote Project

Software Ingenieur

EUR 75’000 pro Jahr

Inserat online seit: 11 Juli

Beschreibung

Software Engineer Reinforcement Learning

Location

Zürich, Zürich, Switzerland

Salary

50000 - 90000 a year (Swiss Francs)

Description

Employment Type: 6 Month Contract

We are looking for a Software Engineer with a focus on data preparation and AI model training. You will work on assembling, annotating, and cleaning training data, while contributing to reward modeling and supervised fine-tuning tasks.

You might thrive in this role if you:

* Have a deep understanding of machine learning and machine learning applications.
* Working knowledge and experience tuning large language models (multimodal) and building evaluations.
* Be willing to dive into large codebases to debug.
* Someone who thrives in a dynamic and technically complex environment.
* Track record of delivering outside-the-box novel solutions to solve real-world constraints.

Responsibilities

* Data Assembly & Annotation: Gather and annotate training data for AI models, ensuring it meets the quality requirements for reward modeling and supervised fine-tuning.
* Data Cleaning & Processing: Conduct data cleaning and preprocessing to ensure models receive high-quality input.
* Model Training: Participate in the training and fine-tuning of models, ensuring that they meet performance and accuracy standards.
* Collaboration: Work with AI engineers, data scientists, and other team members to ensure efficient workflows and data handling.
* Continuous Improvement: Support iterative improvements to models based on performance monitoring and feedback.

Requirements

* Experience: At least 3 years of experience working in a software engineering role focused on AI/ML tasks.
* Data Expertise: Hands-on experience assembling, annotating, and cleaning training data for machine learning models.
* Technical Skills: Proficiency in Python and experience with AI frameworks like TensorFlow or PyTorch.
* Model Training: Familiarity with model training, reward modeling, and supervised fine-tuning techniques.
* Attention to Detail: Strong focus on data quality and attention to detail when handling large datasets.

Bonus Points

* Experience working with reward modeling for AI systems.
* Familiarity with data labeling tools and techniques for supervised fine-tuning.
* Knowledge of cloud platforms for AI/ML workloads.

Please mention the word WARM and tag RNDQuMjA1LjUuNTM= when applying to show you read the job post completely.

Job type:

Remote job

Tags

* software
* python
* training
* support
* cloud
* assembly
* engineer
* engineering
#J-18808-Ljbffr

Bewerben

E-Mail Alert anlegen

Speichern

Ähnlicher Job

Senior c# software engineer c# 80 - 100% (m/w)

Zürich

yellowshark AG

Software Ingenieur

Ähnlicher Job

Senior software engineer (angular/c#) 80-100%

Regensdorf

yellowshark AG

Software Ingenieur

Ähnlicher Job

Embedded software engineer (m/w/d)

Zürich

bbv Software Services AG

Software Ingenieur