Site Reliability Engineer
New Yesterday
Join to apply for the Site Reliability Engineer role at Gizmo
Join to apply for the Site Reliability Engineer role at Gizmo
Get AI-powered advice on this job and more exclusive features.
Gizmo is an AI startup on a mission to make learning so easy that anyone can learn anything. We're building Duolingo for anything - a platform that uses gamification and social mechanics to make learning fun.
With over 1 million monthly active users and $4M in annual recurring revenue, we’re already one of the fastest-growing startups in the UK. Backed by leading investors, we recently raised $16M in Series A funding to accelerate our vision of helping 1 billion people learn.
About the Role
Reporting to the founders, you will own capacity, performance and reliability for Gizmo’s full-stack platform as daily traffic climbs from hundreds of thousands to millions of users. You’ll write code across the stack, but your charter is classic SRE: defend SLOs, eliminate toil, and raise the ceiling on scale before it becomes a hard limit.
Responsibilities
- Define SLIs/SLOs for latency, availability and error rate; codify error budgets and partner with product teams on trade-offs.
- Perform load-testing, capacity modelling and up-front scalability design for PostgreSQL, OpenSearch, Redis, Hasura and CF Workers; produce data-driven scaling plans.
- Extend metrics, structured logging and tracing; establish alert rules that page only on user-visible impact; build actionable runbooks.
- Join the on-call rotation, lead blameless post-mortems, drive remediation work to closure and track MTTR/MTBF improvements.
- Automate repetitive ops on Kubernetes and CI/CD; keep “toil” under 50% of your time by pushing fixes into code.
- Coach full-stack engineers on query optimisation, schema design and back-pressure techniques; document patterns and anti-patterns by creating an SRE playbook.
Qualifications
- Hands-on scale experience: you have run relational stores at 100 k+ TPS or 1 M+ concurrent users (e.g., multi-tenant PostgreSQL, sharded MySQL).
- Strong backend fundamentals around concurrency, caching, indexing and distributed systems trade-offs.
- Proven track record of setting SLOs, building dashboards (Prometheus/Grafana, OpenTelemetry, etc.) and tuning alerts.
- Comfort with Kubernetes, IaC and cloud-native patterns; can debug from network to application layer.
- Start-up bias for action: you prioritise high-leverage fixes, ship iteratively and own outcomes end-to-end.
- Collaborative and feedback-driven; you welcome post-mortem culture and continuous improvement.
- Driven by impact - you prioritise work that moves the needle!
- Nice-to-haves: experience with Hasura internals, Cloudflare Workers edge optimisation, or running OpenSearch clusters at scale.
- Highly competitive salary.
- You'll own a piece of what you're building - equity included.
- Hybrid working model with 4 days in our East London office, ideally located between Shoreditch High Street, Old Street, and Liverpool Street stations.
- The opportunity to become one of the earliest employees in one of the UK’s fastest-growing startups.
- Private health insurance.
Seniority level
Seniority level
Mid-Senior level
Employment type
Employment type
Full-time
Job function
Industries
Education, Software Development, and Computer Games
Referrals increase your chances of interviewing at Gizmo by 2x
Get notified about new Site Reliability Engineer jobs in London Area, United Kingdom.
London, England, United Kingdom 2 weeks ago
London, England, United Kingdom 2 weeks ago
London, England, United Kingdom 2 months ago
Colchester, England, United Kingdom 2 weeks ago
Isleworth, England, United Kingdom 2 weeks ago
London, England, United Kingdom 1 day ago
London, England, United Kingdom 1 month ago
London, England, United Kingdom 1 day ago
Site Reliability Engineer at High Growth B2C Startup
London, England, United Kingdom 1 week ago
London, England, United Kingdom 1 day ago
Site Reliability Engineer, ML Infrastructure, Large Models SRE
London, England, United Kingdom 1 week ago
London, England, United Kingdom 6 days ago
London, England, United Kingdom 2 weeks ago
London, England, United Kingdom 2 months ago
Greater London, England, United Kingdom 6 days ago
London, England, United Kingdom 2 weeks ago
London, England, United Kingdom 1 week ago
Greater London, England, United Kingdom 3 days ago
South Croydon, England, United Kingdom 5 days ago
Tottenham, England, United Kingdom 1 month ago
London, England, United Kingdom 4 days ago
We’re unlocking community knowledge in a new way. Experts add insights directly into each article, started with the help of AI.
#J-18808-Ljbffr- Location:
- London, England, United Kingdom
- Salary:
- £150,000 - £200,000
- Job Type:
- FullTime
- Category:
- Engineering
We found some similar jobs based on your search
-
New Yesterday
Azure - Site Reliability Engineer
-
Northampton
- Engineering
Job Description GTIS Public Cloud Engineering is a global team of circa 100 colleagues based in the UK, India, and the US. We are accountable for strategic engineering and delivery of Public Cloud services within Enterprise Technology. Our team ...
More Details -
-
New Yesterday
Azure - Site Reliability Engineer
-
Chester
- Engineering
Job Description GTIS Public Cloud Engineering is a global team of circa 100 colleagues based in the UK, India, and the US. We are accountable for strategic engineering and delivery of Public Cloud services within Enterprise Technology. Our team ...
More Details -
-
New Yesterday
Site Reliability Engineer II
-
Belfast, Northern Ireland, United Kingdom
-
£100,000 - £125,000
- Engineering
CME Group Belfast, Northern Ireland, United Kingdom Join or sign in to find your next job Join to apply for the Site Reliability Engineer II role at CME Group CME Group Belfast, Northern Ireland, United Kingdom 1 day ago Be among the first 25 ap...
More Details -
-
New Yesterday
Junior Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
Social network you want to login/join with: Junior Site Reliability Engineer, London col-narrow-left Client: Cutover Location: London, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 1d2bc5d8da1...
More Details -
-
New Yesterday
Senior Site Reliability Engineer - Databases (Remote, UK)
-
United Kingdom
-
£80,000 - £100,000
- Engineering
Social network you want to login/join with: Senior Site Reliability Engineer - Databases (Remote, UK), United Kingdom (Remote) col-narrow-left Client: Location: Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: ...
More Details -
-
New Yesterday
Senior Site Reliability Engineer- Central Platforms
-
United Kingdom
-
£80,000 - £100,000
- Engineering
Social network you want to login/join with: Senior Site Reliability Engineer- Central Platforms col-narrow-left Client: SS&C Technologies Holdings Location: United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job R...
More Details -