Senior Site Reliability Engineer

New Today

Overview

Location: Remote / Hybrid – UK
Sector: E-Commerce & Retail Platforms

A global retail brand is scaling its e-commerce and digital customer platforms, handling millions of daily transactions and peak seasonal traffic. To support this growth, they are hiring a Site Reliability Engineer with deep expertise in observability, cloud scalability, and performance tuning.

What you’ll do

  • Build and maintain highly scalable cloud infrastructure for large-scale e-commerce platforms.
  • Develop monitoring and observability frameworks to ensure fast response to performance bottlenecks.
  • Optimise CDN, caching, and APIs for high-traffic shopping events (e.g., Black Friday).
  • Drive automation and CI/CD pipelines to accelerate feature delivery without compromising stability.
  • Partner with software engineering teams to ensure always-on shopping experiences.

What we’re looking for

  • Proven track record in high-scale distributed systems (retail, e-commerce, digital platforms).
  • Expertise in observability stacks (Grafana, Prometheus, Datadog, NewRelic, Elastic).
  • Strong cloud skills (AWS/GCP/Azure) including Kubernetes and serverless.
  • Solid coding skills for automation (Python, Go, JavaScript, Bash).
  • Experience optimising performance in high-traffic digital platforms.

This is your chance to build reliability at retail scale, where seconds of downtime mean millions in lost revenue.

Highly competitive salary plus bonus % paid yearly.

Venquis is acting as an Employment Agency in relation to this vacancy.

#J-18808-Ljbffr
Location:
London
Job Type:
PartTime
Category:
Engineering

We found some similar jobs based on your search