Jobs
Meine Anzeigen
Meine Job-Alerts
Anmelden
Einen Job finden Tipps & Tricks Firmen
Suchen

Deep learning engineer - llm and vlm model compression

Zürich
NVIDIA
Model
EUR 30’000 - EUR 80’000 pro Jahr
Inserat online seit: 1 April
Beschreibung

We are looking for DL engineers passionate about building deep learning frameworks for large language (LLM) and vision language (VLM) model compression that push the boundaries of AI efficiency. In this role, you’ll collaborate with world‑class teams across NVIDIA to advance both the software and hardware stack that powers modern AI.


What You’ll Be Doing

* Design and implement a deep learning framework for compressing large language and vision‑language models to deliver highly optimized, high‑performance AI systems used worldwide.
* Develop and integrate new algorithms for pruning, NAS, and distillation in collaboration with NVIDIA researchers and engineers.
* Experiment with compressing the latest LLMs and VLMs, analyzing their performance and behavior across diverse workloads.
* Collaborate with researchers and engineers across NVIDIA, providing guidance on improving the design, usability and performance of workloads.
* Lead best‑practices for building, testing, and releasing DL software.


What We Need To See

* 8+ years of experience in Deep Learning and software development.
* BSc, MS or PhD degree in Computer Science, Computer Architecture or related technical field.
* Hands‑on experience with LLM or VLM model training or inference.
* Excellent Python programming skills.
* Extensive knowledge of at least one DL Framework (PyTorch, TensorFlow, JAX, MxNet) with practical experience in PyTorch required.
* Strong problem‑solving and analytical skills.
* Algorithms and DL fundamentals.


Ways To Stand Out From The Crowd

* Experience applying and implementing model compression techniques such as pruning, NAS, distillation, and quantization.
* Experience building deep learning frameworks for training, inference, model compression, or related topic.
* GPU programming experience (CUDA or OpenCL) is a plus but not required.
* First‑author publication in a top‑tier deep learning or AI conference.

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 292,500 PLN - 507,000 PLN for Level 4, and 375,000 PLN - 650,000 PLN for Level 5.

#J-18808-Ljbffr

Bewerben
E-Mail Alert anlegen
Alert aktiviert
Speichern
Speichern
Ähnlicher Job
Payroll specialist | 70-100% | 6-9 monate befristet | hybrid working model | fribourg oder zürich
Zürich
Befristet
SMG Swiss Marketplace Group
Model
Ähnlicher Job
Director private clients - autoscout24 & motoscout24 | 100% | hybrid working model | zurich switzerland
Zürich
SMG Swiss Marketplace Group
Model
Ähnlicher Job
Digital growth & content creator automotive | 100% | hybrid working model | zürich
Zürich
SMG Swiss Marketplace Group
Model
Ähnliche Jobs
Kunst und Kultur Jobs in Zürich
Jobs Zürich
Jobs Zürich (Bezirk)
Jobs Zürich (Kanton)
Home > Stellenanzeigen > Kunst und Kultur Jobs > Model Jobs > Model Jobs in Zürich > Deep Learning Engineer - LLM and VLM Model Compression

Jobijoba

  • Karriere & Bewerbung
  • Bewertungen Unternehmen

Stellenanzeigen finden

  • Stellenanzeigen nach Job-Titel
  • Stellenanzeigen nach Berufsfeld
  • Stellenanzeigen nach Firma
  • Stellenanzeigen nach Ort

Kontakt / Partner

  • Kontakt
  • Veröffentlichen Sie Ihre Angebote auf Jobijoba

Impressum - Allgemeine Nutzungsbedingungen - Datenschutzerklärung - Meine Cookies verwalten - Barrierefreiheit: Nicht konform

© 2026 Jobijoba - Alle Rechte vorbehalten

Bewerben
E-Mail Alert anlegen
Alert aktiviert
Speichern
Speichern