Lead Site Reliability Engineer
1 Days Old
JOB TITLE: Lead Site Reliability Engineer
LOCATION(S): Edinburgh, Halifax or London
HOURS: Full time
WORKING PATTERN: Hybrid, 40% (or two days) in one of the above offices
About this Opportunity
The Lead SRE is accountable for the reliability, scalability, and performance of cloud infrastructure and platform services supporting Risk Foundations. This role ensures that services meet defined Service Level Objectives (SLOs), manages error budgets, and leads incident and problem management across multiple feature teams! The Lead SRE supports methodologies in SRE and collaborates with product and application teams to integrate reliability engineering into delivery pipelines!
Key Responsibilities
Reliability & Performance Management: Design, implement and own the SLOs for critical platform services. Monitor system health, manage error budgets, and drive improvements in Mean Time to Failure (MTTF) and Mean Time to Recovery (MTTR).
Incident & Problem Management: Lead incident response and post-mortem analysis. Ensure root cause identification and long-term remediation strategies are implemented.
Platform Advocacy & Collaboration: Champion SRE principles across Risk Foundations Labs. Collaborate with Lab Product Owners, Engineering Leads, and application teams to embed reliability into design and delivery.
Technical Leadership: Provide technical oversight across cloud infrastructure, CI/CD pipelines, observability tooling, and automation frameworks. Guide engineers in adopting scalable and resilient solutions.
Continuous Improvement: Identify and implement improvements in deployment, monitoring, and alerting processes. Drive automation to reduce toil and improve operational efficiency.
Governance & Compliance: Ensure platform services adhere to internal risk, security, and compliance standards. Support audit and regulatory reporting requirements.
About us
If you think all banks are the same, you’d be wrong. We’re an innovative, fast-changing business that’s shaping finance as a force for good. A bank that’s empowering its people to innovate, explore possibilities and grow with purpose.
What you'll need
Proven experience embedding SRE practices within large-scale cloud environments.
Strong understanding of observability, monitoring, and incident response tooling.
Experience with infrastructure-as-code, CI/CD, and cloud-native technologies (e.g., GCP, Azure).
Ability to lead cross-functional teams and influence technical direction.
Familiarity with risk and compliance frameworks in financial services is a plus.
About working for us
Our focus is to ensure we're inclusive every day, building an organisation that reflects modern society and celebrates diversity in all its forms.
We want our people to feel that they belong and can be their best, regardless of background, identity or culture.
We were one of the first major organisations to set goals on diversity in senior roles, create a menopause health package, and a dedicated Working with Cancer initiative.
And it’s why we especially welcome applications from under-represented groups.
We’re disability confident. So, if you’d like reasonable adjustments to be made to our recruitment processes, just let us know.
We also offer a wide-ranging benefits package, which includes:
A generous pension contribution of up to 15%
An annual bonus award, subject to Group performance
Share schemes including free shares
Benefits you can adapt to your lifestyle, such as discounted shopping
30 days’ holiday, with bank holidays on top
A range of wellbeing initiatives and generous parental leave policies
Want to do amazing work, that’s interesting and makes a difference to millions of people? Join our journey.
#J-18808-Ljbffr- Location:
- City Of Edinburgh, Scotland, United Kingdom
- Salary:
- £125,000 - £150,000
- Job Type:
- FullTime
- Category:
- IT & Technology
We found some similar jobs based on your search
-
1 Days Old
Site Reliability Engineer Senior Lead
-
Slough, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Social network you want to login/join with: Site Reliability Engineer Senior Lead, Slough col-narrow-left Client: Mars Location: Slough, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 202520eca...
More Details -
-
1 Days Old
Lead Site Reliability Engineer
-
City Of Edinburgh, Scotland, United Kingdom
-
£125,000 - £150,000
- IT & Technology
JOB TITLE: Lead Site Reliability Engineer LOCATION(S): Edinburgh, Halifax or London HOURS: Full time WORKING PATTERN: Hybrid, 40% (or two days) in one of the above offices About this Opportunity The Lead SRE is accountable for the reliability,...
More Details -
-
1 Days Old
Lead Cloud Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
Join to apply for the Lead Cloud Site Reliability Engineer role at LSEG Join to apply for the Lead Cloud Site Reliability Engineer role at LSEG ABOUT US: LSEG (London Stock Exchange Group) is more than a diversified global financial markets in...
More Details -
-
1 Days Old
Lead Site Reliability Engineer
-
Guildford, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
Join to apply for the Lead Site Reliability Engineer role at Boehringer Ingelheim 1 day ago Be among the first 25 applicants Join to apply for the Lead Site Reliability Engineer role at Boehringer Ingelheim Get AI-powered advice on this job and...
More Details -
-
1 Days Old
Lead Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
End Date Monday 28 July 2025 Salary Range £104,686 - £123,160 We support flexible working – click here for more information on flexible working options Flexible Working Options Hybrid Working, Job Share Job Description Summary . Job Description ...
More Details -