Site Reliability Engineer
New Today
Job Description
Company Description
WALT Labs, a leading managed service provider, is dedicated to empowering businesses by harnessing the power of cloud technology. Our team specializes in delivering customized solutions tailored to meet the unique needs of our clients, driving growth and operational efficiency across industries. From supporting small businesses with seamless data migration to enabling large corporations to manage complex infrastructure projects, we provide exceptional service while staying at the forefront of cloud technology advancements.
Role Description
This is a full-time on-site role 3 days a week minimum in Kings Cross London. We are seeking a skilled Site Reliability Engineer with a strong focus on Google Cloud Platform (GCP) to join our dynamic team. In this role, you’ll be responsible for maintaining cloud infrastructure, managing incidents, and ensuring seamless operations for our clients. You’ll use tools like incident.io and JIRA to manage and resolve support requests efficiently.
Qualifications
- 8-10 years of experience managing applications and infrastructure performance.
- Proven experience with Google Cloud Platform (GCP) services.
- Familiarity with incident.io for incident tracking and management (of equivalent)
- Proficiency in using JIRA for task management and support workflows.
- Strong experience working with observability tools (Grafana)
- Strong troubleshooting and problem-solving skills in cloud environments.
- Understanding of cloud security and performance optimisation best practices.
- Knowledge of scripting or automation tools (e.g., Python, Terraform) is a plus.
- Excellent communication and customer service skills.
- Certifications in GCP (Professional certifications) are highly desirable.
- Ability to work under pressure and prioritise tasks effectively.
- Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience).
Responsibilities
- Provide technical support and resolve issues related to Google Cloud Platform (GCP) services and AWS.
- Manage and respond to cloud incidents using incident.io, ensuring timely resolution.
- Use JIRA to log, track, and prioritize support tickets and workflow tasks.
- Monitor and maintain cloud infrastructure for performance, reliability, and security.
- Collaborate with teams to identify and implement solutions to technical challenges.
- Assist in deploying, configuring, and optimising GCP resources.
- Create and maintain documentation for troubleshooting processes and best practices.
- Proactively identify opportunities to improve cloud environments and support processes.
- Support clients and stakeholders by providing clear communication and updates during incident resolution.
- Stay up-to-date with the latest GCP developments and contribute to team knowledge sharing.
Benefits
- 20 holiday days + bank holidays (earn 1.5 days every 3 years)
- Private health insurance
- Location:
- London
- Job Type:
- FullTime
- Category:
- Technology
We found some similar jobs based on your search
-
New Today
Site Reliability Engineer, Cloud Security
-
City Of London, England, United Kingdom
-
£80,000 - £100,000
- IT & Technology
About the Role Miro's Cloud Security team is a hybrid function, blending security and cloud engineering expertise. Embedded directly within our Cloud Engineering & Operations organization, we are at the core of Miro’s infrastructure to promote best ...
More Details -
-
New Today
Site Reliability Engineer (SRE)
-
City Of London, England, United Kingdom
-
£100,000 - £125,000
- IT & Technology
This role offers a hybrid work offering to be present in our London office twice per week. Reward Gateway|Edenred is a leading digital platform for services and payments for people at work, connecting 52 million users and 2 million partner merchants ...
More Details -
-
New Today
Senior Site Reliability Engineer - Azure
-
London, England, United Kingdom
-
£125,000 - £150,000
- Engineering
Overview Role Overview: We are seeking a highly skilled and motivated Senior Site Reliability Engineer (SRE) to join our engineering team to support critical application deployments in a follow-the-sun environment. In this role, you will leverage yo...
More Details -
-
New Today
Site Reliability Engineer
-
City Of London
- Engineer, Reliability Engineer, Reliability, Engineering, Site
Job Description Our client is a technology-driven trading firm with 10 years of experience in global markets. Their cutting-edge technology enables rapid response and efficient opportunity capture. The company fosters innovation and encourages hands...
More Details -
-
New Today
Site Reliability Engineer
-
London
- Technology
Site Reliability Engineer is a full-time on-site role 3 days a week minimum in Kings Cross London. You’ll be responsible for maintaining cloud infrastructure, managing incidents, and ensuring seamless operations for our clients. Qualifications 8-10 years of experience managing applications and infrastructure performance.
More Details -
-
New Yesterday
Site Reliability Engineer (Python Development)
-
London
- Technology
Job Description Job Title: Site Reliability Engineer (Python Development) Do you want to join a global leader within the fintech space? Our client is seeking a Site Reliability Engineer to join their global team. In this role, the successful ...
More Details -