Site Reliability Engineer
New Today
Overview
WeDo has partnered with a SME experiencing growth and is seeking a Site Reliability Engineer (SRE) to join the existing platform team. The role supports the continued expansion of service levels, focusing on ownership of the Monitoring, Observability and Security pillar.
Responsibilities
- Own the monitoring, observability and security pillar to improve reliability and resilience.
- Maintain and enhance monitoring and alerting solutions (CheckMK, Prometheus, Grafana, etc.).
- Work with Kubernetes (K8s) experience, ideally EKS, to manage and scale environments.
- Implement and manage Infrastructure as Code using Terraform.
Qualifications
- Mid-Senior level experience in a platform/SRE role.
- Experience moving infrastructure to AWS (the platform team has focused on moving most infra to AWS).
- Proficiency with monitoring, observability, and security practices.
Details
- Location: Fully Remote (UK Based; one day social meet up in London)
- Employment type: Full-time
- Seniority level: Mid-Senior level
- Job function/Industry: Information Technology; IT System Custom Software Development
- Process: 2-stage interview; start-to-finish likely around 2 weeks
- Location:
- United Kingdom
- Salary:
- £100,000 - £125,000
- Job Type:
- FullTime
- Category:
- Engineering