Senior Site Reliability Engineer (SRE)

New Yesterday

Location: London, England, United KingdomSalary: Not disclosedDescriptionThe OpportunityAre you interested in making a difference? To work for a tech-for-good company whose reason for being is to help all boards and leadership teams to be a powerful driver of performance and a force for good? Board Intelligence is on a mission to bring kindness and success together and to drive companies to think about what matters. We work with over 30,000 Chairs, CEOs, and board members to embed the discipline of focus into their organisations, and we’re helping a new board every day to focus on what matters. We are in it for the long term, come join us on this journey.As a Senior Site Reliability Engineer (SRE), you'll be joining a team whose mission is to ensure the availability, performance, security and reliability of our platform and core services, ensuring that they meet the needs of our internal and external users. You will take the lead on projects across the entire breadth of our tech stack, from planning all the way through to delivery and maintenance - you will bring others on the team with you on the journey too and not just go it alone. You will be responsible for visibility and monitoring of those systems, for building tooling and automation to reduce TOIL and for responding to incidents as part of our 24/7 SRE on-call team.Reliability Engineering at Board IntelligenceThe SRE team:Strives to provide the highest standards of Availability, Scalability, Performance and Security for our Software as a Service environments across multiple cloud vendors and our own private cloud physical infrastructure hosted at datacenters in the UK.Provides enabling infrastructure, pipelines and tooling to support product development.Works closely with security, product development and commercial teams to ensure the future suitability of our infrastructureAgrees and sets standards and methodologies for engineering workProactively monitors our platform and responds to incidents as part of a 24 / 7 rotaKey responsibilities of the roleWe're looking for a great Senior SRE to be a hands on individual contributor to key technical projects and to help us build a first-class SRE function. This role will involve:Hands on work with technical projects, taking direction from the team PrincipalsImplement and maintain monitoring solutions / metric-driven alerting, logging and tracingTroubleshoot in complex environmentsEstablish and measure SLIs and SLOs with engineering teams and continuously improve relationships and ways of working with other engineering teamsParticipate in periodic 24x7 paid on-call dutiesHolds, or is eligible to obtain HMG Security Clearance at the SC levelBuild and manage systems, infrastructure and applications using infrastructure as code and automation (Terraform, Ansible, K8s, Helm, Go)Pair programming, knowledge sharing and running appropriate training sessions for the teamWriting well-defined tickets (and supporting documentation when required) as well as keeping them up-to-dateStrong communication skills with the ability and openness to work across a range of varied stakeholders and confidence to check and challenge when required.Cares about evolving SRE best practices (through a security lens) and is driven to find the right ways of working with the teamAppreciation of architecture decisions and trade offsIs self-driven and constantly striving to improve everything with automation and monitoringIs able and willing to travel to our physical datacenters in the U.K should the need ariseDemonstrates and promotes positive attitudes and behaviours: collaboration, learning, sharing, respect and kindness #J-18808-Ljbffr
Location:
London, England, United Kingdom
Job Type:
FullTime

We found some similar jobs based on your search