Site Reliability Engineer - Database and Observability (f/m/d) Lausanne, Switzerland or remote in EU/UK, Switzerland | Posted on 06/20/2025
City Lausanne, Switzerland or remote in EU/UK
Job Description Exoscale is the leading Swiss/European cloud service provider.
With services covering the full cloud infrastructure spectrum - from fast deploying virtual machines to S3 compatible object storage - Exoscale provides a simple and scalable experience to let its clients focus on their core business.
Join a dynamic working environment with a cutting-edge distributed team based in Lausanne. Exoscale strives to create an environment with great working conditions and welcomes diverse applicants.
As part of its ongoing efforts to grow its infrastructure footprint, Exoscale is hiring a Site Reliability Engineer .
The site reliability engineer plays a critical role in ensuring the constant availability of the Exoscale platform. The engineering team works on designing, developing, operating, and supporting products.
With an expanding customer base and new products, site reliability engineers build and maintain a wide range of technologies. As users of Exoscale itself, site reliability engineers also actively participate in product improvements.
This position focuses on designing, developing, and maintaining Exoscale's core platform and security components.
Some of the challenges you will be working on:
Maintain and optimize our persistent data infrastructure, including MariaDB, Cassandra, FoundationDB, and Kafka.
Enhance and evolve our observability stack to improve system visibility and performance monitoring.
Participate in automation and orchestration efforts to streamline operations and reduce manual intervention.
Improve processes to ensure scalability, reliability, and high availability of our infrastructure.
Join the on-call rotation after completing training.
Ideal candidates are:
Experienced with Linux and systems administration.
Proficient in MariaDB and managing large-scale database deployments.
Proficient in Go programming language and familiar with distributed systems principles.
Familiar with Prometheus and the broader observability ecosystem.
Interested in Kafka, Cassandra, and/or FoundationDB, or eager to learn.
Skilled in configuration management and managing large-scale infrastructure.
Passionate about automation and workflow optimization.
Team players who thrive in distributed environments.
Curious, autonomous, and eager to learn new technologies daily.
Strong communicators in English, both written and spoken.
What we offer:
Flexible working hours and remote work options.
Autonomous working conditions with creative freedom.
Modern office environment with good public transport access.
Team events, training, and further education opportunities.
Candidates willing to learn new topics are encouraged to apply.
#J-18808-Ljbffr