Site Reliability Engineer
New Today
Social network you want to login/join with:
Are you passionate about building reliable, scalable, and high-performing systems? Do you thrive on solving complex infrastructure challenges while driving automation and observability best practices? If so, we want to hear from you!
At Thredd, we’re looking for a Site Reliability Engineer to act as a North Star for this evolving discipline. As our first engineer in this role, you’ll have the unique opportunity to shape our SRE strategy, establish best practices, and set the standard for service reliability and performance.
The Impact You’ll have as a Site Reliability Engineer
- Design and oversee the implementation of complex, secure, and scalable network solutions that support global transaction processing.
- Lead network innovation by identifying opportunities to adopt emerging technologies and drive efficiency.
- Coordinate and prioritise network‑related initiatives across teams, balancing operational needs with strategic growth.
- Mentor and support engineers within the team, fostering technical excellence and a customer‑focused mindset.
- Drive performance and reporting, delivering insights and data that help optimise system health and uptime.
- Collaborate with stakeholders, vendors, and service providers to ensure seamless integration and service quality.
- Develop and enforce quality assurance protocols and documentation standards across our network landscape.
- Own strategic network planning, ensuring infrastructure evolves in step with our product and market expansion.
What You’ll Be Doing as a Site Reliability Engineer
- Building and maintaining the infrastructure, tooling, and technical foundation of Thredd.
- Ensuring high service uptime and reliability so product teams can innovate effectively.
- Playing a key role in shaping the core technology layers that drive our platform’s success.
What You'll Bring to the Site Reliability Engineer position
- Proven experience implementing SRE principles at scale, including deep knowledge of SLI/SLO/SLA differences.
- A product engineering background with strong coding skills in Python or similar.
- Experience with incident management frameworks and evolving them for efficiency.
- Expertise in cloud platforms (AWS preferred) and container orchestration (Docker, Kubernetes, ECS).
- Solid understanding of microservices, service mesh, and modern architectural concepts.
- A collaborative mindset – you thrive on helping others and driving company-wide impact.
Nice to have
- Experience working in regulated industries (e.g., PCI compliance).
- Background in capacity planning, performance, and load testing.
- Sysadmin skills for troubleshooting disk, network, and infrastructure issues.
Where you’ll work
Our working model varies depending on the specific role and team requirements. We strive to provide flexibility whilst ensuring that each position is best supported for optimal collaboration and performance.
This Site Reliability Engineer position requires you to be in the London office (Holborn) one day per week.
About us
Thredd is the trusted next-gen payments partner for innovators looking to modernise their payments offering. Certified by Mastercard, Visa and Diners & Discover, we process billions of debit, prepaid, and credit transactions annually, supporting consumer and corporate fintechs, digital banks, and embedded finance providers across the globe. Our unique offering is our client-centric approach, combining hands-on support with modern, reliable, and scalable technology.
Our assured solution accelerates the development and delivery of consumer and corporate payments components embedded within digital banks, as well as for expense management, B2B payments, crypto, lending, credit, Buy Now Pay Later, FX, remittance, and open banking innovators.
Since 2007, Thredd has enabled market leaders through our highly reliable, secure, and scalable platform and supported many of our client's growth journeys - from early-stage startup through to globally recognized unicorns, including Monzo, Revolut, and Starling.
Diversity and Inclusion at Thredd
Here at Thredd, we are committed to building a diverse and inclusive workplace where everyone feels valued, respected and empowered. We welcome applications from people of all backgrounds, experiences and identities. If you require any adjustments during the recruitment process, please let us know and we would be happy to support you.
Our Values
Our values-driven culture is what unites our teams globally and our teams is what drives our success;
- Own it and deliver – Taking responsibility for your own performance and being successful in your own role
- Collaborate purposefully – Building trusted relationships with colleagues, supporting activities and being successful together
- Think differently – Asking questions to check understanding and sharing your ideas to support continuous improvement
- Act courageously – Stepping out of your comfort zone and embracing change to help you learn and grow
- Location:
- London, England, United Kingdom
- Salary:
- £150,000 - £200,000
- Job Type:
- FullTime
- Category:
- Engineering
We found some similar jobs based on your search
-
New Today
Site Reliability Engineer - Traffic and Secure Services
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
Site Reliability Engineer - Traffic and Secure Services London, England, United Kingdom Software and Services At Apple, we build systems that power services used by hundreds of millions of people around the world, and every second counts. The Servic...
More Details -
-
New Today
Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
Site Reliability Engineer is responsible for building and maintaining the infrastructure, tooling, and technical foundation of Thredd. The position requires you to be in the London office (Holborn) one day per week. You must have a product engineering background with strong coding skills in Python or similar.
More Details -
-
New Today
Site Reliability Engineer - 32911
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
Join us on the Splunk TechOps team, empowering our customers to make machine data accessible, usable, and valuable to everyone! The Splunk TechOps organization runs Splunk cloud, blending SRE, Systems Engineering, and Service Engineering disciplines ...
More Details -
-
New Today
Google Product Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
JOB TITLE Google Product Site Reliability Engineer LOCATION London HOURS Full-time – 35 hours per week WORKING PATTERN Our work style is hybrid, which involves spending at least two days per week, or 40% of our time, at our London office. About ...
More Details -
-
New Today
Google Product Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
JOB TITLE: Google Product Site Reliability Engineer SALARY: £81,999 - £91,110 LOCATION(S): London HOURS: Full-time - 35 hours per week WORKING PATTERN: Our work style is hybrid, which involves spending at least two days per week, or 40% of our t...
More Details -
-
New Today
Site Reliability / Gitops Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
Social network you want to login/join with: Site Reliability / Gitops Engineer, London col-narrow-left Client: Canonical Location: London, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 06797b2...
More Details -