Site Reliability Engineer (SRE) for GCP Analytics Platform

New Yesterday

Site Reliability Engineer (SRE) for GCP Analytics Platform

We are seeking a Site Reliability Engineer (SRE) to join our Data & Platform Enablement Lab. As an SRE, you will play a pivotal role in shaping and supporting a best-in-class analytics platform on Google Cloud within Lloyds Banking.

Our mission is to drive cost efficiency, transparency, and accelerated time-to-value through automation, integration, and evergreening of the latest cloud-native tools and features. You will be part of a team that provides foundational capabilities for data access, storage, processing, and deployment, empowering users through self-service, automated engagement, and Golden Paths that simplify platform adoption.

Your focus will be on ensuring the reliability, scalability, and observability of our cloud infrastructure, while continuously improving service levels and operational excellence. This is a unique opportunity to contribute to a platform that underpins enterprise-wide data provisioning, enabling faster, more efficient delivery of insights and innovation across the Group.

Responsibilities:

  • Manage the development and/or operation of significant aspects of the data management system with guidance from senior colleagues.
  • Grows own capabilities by pursuing and investing in personal development opportunities and develops the capabilities of direct reports by working within existing development framework.
  • Delivers prescribed outcomes for area of responsibility by working within established knowledge management systems.
  • Analyses specified problems and issues to find the best technical and/or professional solutions.
  • Develops product specifications while designing testing procedures and standards.

Requirements:

  • Google Cloud Platform (GCP) Certification, such as Professional Cloud DevOps Engineer or Professional Cloud Architect.
  • 3 to 5 years of hands-on experience working with Google Cloud products, particularly in the context of analytics platforms or large-scale infrastructure.
  • Strong understanding of Site Reliability Engineering (SRE) principles, including SLIs/SLOs, error budgets, and incident response.
  • Experience with infrastructure as code (e.g., Terraform, Deployment Manager) and CI/CD pipelines.
  • Proficiency in monitoring, logging, and observability tools (e.g., Stackdriver, Prometheus, Grafana).

What we offer:

We provide a wide-ranging benefits package, which includes a generous pension contribution of up to 15%, an annual performance-related bonus, share schemes including free shares, benefits you can adapt to your lifestyle, such as discounted shopping, 30 days' holiday, with bank holidays on top, and a range of wellbeing initiatives and generous parental leave policies.

We are committed to creating an environment in which everyone can thrive, learn, and develop. Our ambition is to be the leading UK business for diversity, equity, and inclusion, supporting our customers, colleagues, and communities.

#J-18808-Ljbffr
Location:
Manchester
Job Type:
FullTime
Category:
IT & Technology

We found some similar jobs based on your search