Job description:
Support the Observability team in defining, evaluating, and developing a robust observability stack for our customers deployments.
Interact with the development and infrastructure teams to define solutions for collecting data to be included in the observability stack.
Design and maintain customized solutions developed in-house to improve observability.
Skills required:
Observability Cloud OCI/Azure
Grafana
Prometheus/Alert Manager
Loki
Opentelemetry collector and concepts
Development skills (Java/Python/GO)
IaC and IT automation/configuration tools (Terraform/Ansible)
Kubernetes
Knowledge of Observability pillars/signals
PagerDuty and Events Management tools