Observability Platform Engineer (SRE Focus)
New Today
Observability Platform Engineer (SRE Focus)
The Mission: We're building a world-class Observability function, and we're looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you've ever been on-call, silenced a noisy monitor, or traced a ghost bug across microservices outside core hour - we want to hear from you. This isn't a generic Platform Engineer role. You'll be laser-focused on observability, reliability, and developer empowerment, working closely with teams to make sure we don't just know when things break - but why.
Responsibilities
- Designing and scaling on-call systems that engineers don't dread being part of
- Building out Datadog monitoring, alerting, dashboards, and log pipelines for our Kubernetes-based environments
- Defining and managing SLOs, SLIs, and error budgets — and helping teams stick to them
- Creating scorecards and software catalogs so engineers know what\'s healthy, what\'s broken, and who owns what
- Training and enabling dev teams to own their own observability, alerts, and incident response
- Introducing chaos engineering practices
- Driving a culture of reliability, with incident reviews, shared learnings, and transparency
You Might Be a Fit If You...
- Have production experience with observability tools (especially Datadog) in cloud-native environments
- Have set up monitoring and alerting across Kubernetes services
- Have built or scaled on-call systems in startups or large-scale environments
- Know how to reduce alert fatigue and love a good MTTR chart
- Have experience with infrastructure as code (Terraform preferred)
- Believe that great developer experience includes clear visibility and ownership
- Are curious about — or already practicing — chaos engineering
Bonus Points
- Experience with OpenTelemetry, Fluent Bit, or similar
- Familiarity with service catalog tooling (e.g., Backstage)
- Comfortable running or facilitating game days or failure drills
- Prior involvement in setting up scorecards for service health
What This Role Isn't
- This is not a traditional platform or infra role
- You won\'t be spending your days tweaking CI/CD pipelines or setting up VPCs
- We\'re looking for someone obsessed with how systems behave in production — not just how they\'re deployed
The Stack
- Cloud: AWS (EKS, Lambda, etc.)
- Observability: Datadog, OpenTelemetry
- Infra as Code: Terraform
- Orchestration: Kubernetes (EKS)
- Logging: Fluent Bit, FireLens
- Catalogs/Scorecards: Backstage (or custom)
Apply Now
If this sounds like your kind of role, we\'d love to hear from you. Drop us a message with your CV and a note about the coolest monitoring setup or incident resolution you\'ve ever worked on.
Benefits
Why join YouLend?
- Award-Winning Workplace: YouLend has been recognised as one of the Best Places to Work 2024 & 2025 by the Sunday Times for being a supportive, diverse, and rewarding workplace
- Award-Winning Fintech: YouLend has been recognised as a Top 250 Fintech Worldwide company by CNBC
We offer a comprehensive benefits package that includes:
- Stock Options
- Private Medical insurance via Vitality
- EAP with Health Assured
- Enhanced Maternity and Paternity Leave
- Modern and sophisticated office space in Central London
- Free Gym in office building in Holborn
- Subsidised Lunch via Feedr
- Deliveroo Allowance if working late in office
- Monthly in office Masseuse
- Team and Company Socials
- Football Power League / Squash Club
We champion diversity and equal opportunity employment practices. Our hiring decisions are based on qualifications, merit, and business requirements, free from discrimination based on race, gender, age, disability, religion, nationality, or any other protected basis under applicable law.
Seniority level
- Mid-Senior level
Employment type
- Full-time
Job function
- Information Technology
Industries
- IT Services and IT Consulting
- Location:
- London, England, United Kingdom
- Salary:
- £125,000 - £150,000
- Job Type:
- FullTime
- Category:
- IT & Technology
We found some similar jobs based on your search
-
New Today
Observability Platform Engineer (SRE Focus)
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Observability Platform Engineer (SRE Focus) The Mission: We're building a world-class Observability function, and we're looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you've ever been on-call, silenced a nois...
More Details -
-
13 Days Old
Observability Platform Engineer (SRE Focus)
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
YouLend is building a world-class Observability function. We’re looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you’ve ever been on-call, silenced a noisy monitor, or traced a ghost bug across microservices - we want to hear from you!
More Details -
-
15 Days Old
Observability Platform Engineer (SRE Focus)
-
London
- IT & Technology
The Mission: We’re building a world-class Observability function, and we’re looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you’ve ever been on-call, silenced a noisy monitor, or traced a ghost bug across micro...
More Details -
-
28 Days Old
Observability Platform Engineer (SRE Focus)
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
We're building a world-class Observability function. We're looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you've ever been on-call, silenced a noisy monitor, or traced a ghost bug across microservices - we want to hear from you.
More Details -
-
28 Days Old
Observability Platform Engineer (SRE Focus)
-
London
- IT & Technology
The Mission: We're building a world-class Observability function, and we're looking for someone who lives for uptime, meaningful alerts, and elegant dashboards. If you've ever been on-call, silenced a noisy monitor, or traced a ghost bug across micr...
More Details -