Site Reliability Engineer

New Today

Overview

WeDo has partnered with a SME experiencing growth and is seeking a Site Reliability Engineer (SRE) to join the existing platform team. The role supports the continued expansion of service levels, focusing on ownership of the Monitoring, Observability and Security pillar.

Responsibilities

  • Own the monitoring, observability and security pillar to improve reliability and resilience.
  • Maintain and enhance monitoring and alerting solutions (CheckMK, Prometheus, Grafana, etc.).
  • Work with Kubernetes (K8s) experience, ideally EKS, to manage and scale environments.
  • Implement and manage Infrastructure as Code using Terraform.

Qualifications

  • Mid-Senior level experience in a platform/SRE role.
  • Experience moving infrastructure to AWS (the platform team has focused on moving most infra to AWS).
  • Proficiency with monitoring, observability, and security practices.

Details

  • Location: Fully Remote (UK Based; one day social meet up in London)
  • Employment type: Full-time
  • Seniority level: Mid-Senior level
  • Job function/Industry: Information Technology; IT System Custom Software Development
  • Process: 2-stage interview; start-to-finish likely around 2 weeks
#J-18808-Ljbffr
Location:
United Kingdom
Salary:
£100,000 - £125,000
Job Type:
FullTime
Category:
Engineering