Principal Site Reliability Engineer
New Today
Overview
Join to apply for the Principal Site Reliability Engineer role at Playson.
Founded in 2012, Playson is a leading iGaming supplier recognized worldwide. We provide our customers with a high-end micro-service-based platform as a service that aims to process billions of financial transactions per day. We provide a cross-regional setup and are chasing latency reduction down to zero. We highly invest in delivering the best game experience and smooth connection regardless of the internet coverage and bandwidth of the game clients.
Responsibilities
- Manage day-to-day alerts, system checks, and issue escalation as necessary.
- Provide 24x7 on-call support for critical SaaS events.
- Document issues and remediation steps.
- Proactively create monitors within the EKS/K8s ecosystem.
- Deploy to EKS/K8s cluster using Terraform and Helm/Flux.
- Enhance infrastructure health by implementing checks and scripts to address known issues.
- Maintain and develop deployment code.
- Implement/integrate new technologies into our Cloud Infrastructure.
- Collaborate with other teams to provide top-notch support and assistance.
- Prioritize customer focus in planning deployments/updates, ensuring minimal impact.
- Conduct RCA and take necessary corrective actions to prevent issue recurrence.
- Assign alert-related actions to the appropriate team after investigation.
- Handle support requests for environment-specific actions.
Qualifications
- Proficiency in Kubernetes (deployment, scaling, troubleshooting).
- Experience with configuration management tools like FluxCD/ArgoCD.
- Strong experience with issue processing (RCA, Postmortems).
- Familiarity with AWS, Terraform, Docker, CI/CD.
- Experience with monitoring tools like DataDog, Prometheus, Grafana, and logging solutions like Elasticsearch, Logstash, and Kibana (ELK Stack) or AWS CloudWatch.
- Strong understanding of networking concepts and protocols.
- Proficiency in at least one scripting language (e.g., Python, NodeJS, Go).
- Proficiency in Git or other version control systems.
- Familiarity with incident response and management tools like PagerDuty, Opsgenie, or VictorOps.
- Ownership, proactiveness, persistence, and passion for maintaining a high-traffic online platform.
Benefits
- Quarterly Bonuses based on transparent and systematic evaluation.
- Flexible Work Schedule.
- Remote Work Option for Enhanced Flexibility.
- Comprehensive Medical Insurance for you and your significant other.
- Financial Support for Life Events.
- Unlimited Paid Vacation.
- Unlimited Paid Sick Leave.
- Reimbursement for professional development courses and training.
If you're ready to embrace ambitious goals and thrive in a dynamic environment, apply now and become part of Playson’s exciting journey in the iGaming world!
Seniority level
- Mid-Senior level
Employment type
- Full-time
Job function
- Engineering and Information Technology
Industries
- Software Development
- Location:
- London
- Job Type:
- FullTime
- Category:
- Engineering
We found some similar jobs based on your search
-
New Today
Principal Site Reliability Engineer
-
London
- Engineering
Overview Join to apply for the Principal Site Reliability Engineer role at Playson . Founded in 2012, Playson is a leading iGaming supplier recognized worldwide. We provide our customers with a high-end micro-service-based platform as a service...
More Details -
-
New Today
Principal Site Reliability Engineer
-
London, England, United Kingdom
-
£150,000 - £200,000
- Engineering
Overview Join to apply for the Principal Site Reliability Engineer role at Playson . Founded in 2012, Playson is a leading iGaming supplier recognized worldwide. We provide our customers with a high-end micro-service-based platform as a service...
More Details -
-
5 Days Old
Principal Site Reliability Engineer - iwoca
-
London
- Engineering
Principal Site Reliability Engineer iwoca London, United Kingdom Apply now Posted 6 days ago Hybrid Job Permanent Competitive Principal Site Reliability Engineer - Core Systems Hybrid in London or Remote within the UK The company Imagine a wo...
More Details -
-
9 Days Old
Principal Site Reliability Engineer
-
United Kingdom
-
£100,000 - £125,000
- Engineering
Principal Site Reliability Engineer (SRE) – IaC (Terraform, Pulumi) Areti are partnering with an award-winning software house, recognised for its innovation and technical excellence. This is a full-time, permanent role offering remote working across the UK.
More Details -
-
9 Days Old
Principal Site Reliability Engineer
-
United Kingdom
- Engineering
Get AI-powered advice on this job and more exclusive features. Direct message the job poster from Areti Group | B Corp Principal Site Reliability Engineer (SRE) – IaC (Terraform, Pulumi) Areti are proud to be partnering with an award-winning software...
More Details -
-
26 Days Old
Principal Site Reliability Engineer
-
London
- Engineering
Social network you want to login/join with: Principal Site Reliability Engineer, London col-narrow-left Client: Orgvue Location: London, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: 465704a68...
More Details -