Mid & Senior Site Reliability Engineers - GDS - G7The Government Digital Service (GDS) is the digital centre of government — we are responsible for setting, leading and delivering the vision for a modern digital government.Our priorities are to drive a modern digital government, by:joining up public sector servicesharnessing the power of AI for the public goodstrengthening and extending our digital and data public infrastructureelevating leadership and investing in talentfunding for outcomes and procuring for growth and innovationcommitting to transparency and driving accountabilityWe are home to the Incubator for Artificial Intelligence (I.AI), the world-leading GOV.UK and at the forefront of coordinating the UK’s geospatial strategy and activity.Job DescriptionAbout Engineering EnablementReporting To GDS Product Group CTO, The Technology Programme Is Boosting Efficiency, Strengthening Security, And Improving Developer Experience By Providing Infrastructure, Tools And Standards.Cloud Platform team - owns and operates a thin central platform for our AWS estate.Developer Experience and Finops team - manages core engineering tooling, proactively works to enhance developer practice & experience and ensures value from our SaaS services.Engineering Access Operations team - owns and operates identity and access management for our systems and acts as an intelligent customer for IT services, improving overall effectiveness.Business Enablement team - manages core business tooling and services, supporting business impact and agilityAbout GOV.UK One LoginGOV.UK One Login Programme represents a once in a generation and career opportunity to simplify and widen access to all digital government services.About GOV.UK PayGOV.UK Pay lets service teams across the public sector take online and over the phone card payments from their users quickly and easily.As a Site Reliability Engineer You Willbe part of a multidisciplinary team developing and supporting one of our product areaswrite infrastructure as code using terraform or CloudFormation to ensure our infrastructure is consistent, reusable and reliabledeploy and configure observability tools to enable our teams to identify and respond to operational issues quickly and effectivelybuild CI/CD pipelines to enable the team to get code into production quickly and reliablyprovide day-to-day support for our platforms and tools to ensure they remain available, secure and robustparticipate in on-call rotations when necessarysolve complex and interesting problemsshare your knowledge and expertise with your peers and the wider team to drive consistency and develop a culture of openness and learningPerson specificationWe’re Interested In People Who Havea deep understanding of Linux operating system internals and are comfortable working with Linux virtual machines or containersproficiency in at least one programming language (we use Ruby, Java, Typescript, and Python)strong experience of working with infrastructure technologies such as databases, web servers, DNS, CDNs, reverse proxies, message queues and load balancersexperience of building and maintaining services in the cloud (preferably AWS)extensive experience of creating infrastructure as code using Terraform or CloudFormationexperience of using container orchestration systems such as Kubernetes, ECS or serverless application design with AWS Lambdaexperience supporting large production servicesstrong Git skillsexperience of creating pipelines in a CI/CD tool like Github Actions or AWS Codepipelinea strong understanding of security principles and how to keep large operational services secure
#J-18808-Ljbffr