We are looking for a talented and motivated Senior Cloud Operations and Automation Engineer to join a dynamic Cloud & Infrastructure team within a leading Swiss technology-driven company.
In this role, you will play a key part in ensuring the reliability, performance, and security of large-scale AWS-based environments, while driving automation, monitoring, and continuous improvement initiatives.
You will act as a technical expert and escalation point for cloud operations, collaborate closely with engineering and application teams, and help the organization evolve toward Site Reliability Engineering (SRE) best practices.
Your Responsibilities
Cloud Operations & Reliability
Lead and perform complex operational support across multi-account AWS environments, balancing stability and agility.
Troubleshoot and resolve high-severity incidents, coordinating with Cloud Engineering and Application teams.
Ensure 24/7 system reliability through proactive monitoring, maintenance, and on-call participation.
Automation & Infrastructure-as-Code
Maintain and evolve infrastructure-as-code using Terraform and GitLab CI/CD pipelines.
Design and implement automation frameworks to streamline operations and reduce manual workload.
Continuously improve deployment and release processes to increase efficiency and reduce downtime.
Monitoring & Observability
Enhance and maintain Datadog dashboards, alerts, and automation rules for proactive incident detection.
Lead improvements in monitoring strategy, observability, and alert management.
Security & Compliance
Ensure compliance with internal security and operational standards across all cloud deployments.
Collaborate with Security, Compliance, and Governance teams to maintain best-in-class infrastructure practices.
Collaboration & Mentoring
Act as a technical mentor for junior engineers and provide guidance on automation, monitoring, and best practices.
Build strong partnerships with business and application teams to ensure high-quality service delivery.
Contribute to documentation, process standardization, and continuous improvement initiatives.
Your Profile
Experience & Technical Skills
Minimum 5 years’ experience in cloud operations, infrastructure, or DevOps environments.
Advanced expertise in AWS services (EC2, S3, IAM, Lambda, RDS, VPC, etc.).
Solid hands-on experience with Terraform, GitLab CI/CD, and Datadog (monitoring, alerting, automation).
Proficiency in Windows Server and Linux system administration.
Working knowledge of SQL databases and network/storage management.
Proven track record in incident response, root cause analysis, and process automation.
Soft Skills
Strong analytical mindset and ability to manage complex, high-pressure situations.
Excellent communication skills and a collaborative approach across multi-functional teams.
Customer-focused, proactive, and improvement-oriented mindset.
Experience mentoring or coaching peers is an asset.
Languages: English: fluent (working language).French a plus.
Why Join
Opportunity to shape and optimize critical cloud and automation frameworks.
Work with modern technologies and a culture that promotes innovation and ownership.
Join a highly skilled, collaborative technical team with strong values and impact.
Competitive package, flexible work model, and long-term growth perspectives.
#J-18808-Ljbffr