Observability Engineer
New Yesterday
The ideal candidate will be passionate about improving access to infrastructure performance, automating operational intelligence, and reducing mean time to resolution (MTTR) through intelligent alerting and root cause analysis. Knowledge Skills and Abilities, Key Responsibilities:
Own and evolve the enterprise observability strategy across all infrastructure tracks
Design, implement, and support event management and impact analysis workflows using platforms such as BMC Helix Operations Manager
Integrate and correlate data from multiple sources (e.g., 20+ monitoring systems) into a unified monitoring and alerting framework.
Apply AIOps principles to reduce alert noise, detect anomalies, and predict/prevent potential outages
Collaborate with infrastructure, application, and service desk teams to define meaningful service-level metrics and dashboards
Maintain and extend the configuration of monitoring tools, event enrichment, suppression rules, and correlation logic
Develop and support automation for observability platform configuration using Infrastructure as Code
Define best practices for monitoring new platforms and services in collaboration with engineering and operations teams
Support the integration of observability data with ITSM platforms (e.g., Ivanti Neurons ITSM) to streamline incident and change processes
Ensure observability platforms are reliable, secure, well-documented, and continuously aligned with business requirements
Knowledge, Skills and Abilities
Specialist Knowledge:
Demonstrable experience in observability engineering, infrastructure monitoring, or event management roles
Experience with traditional and modern observability stacks such as SCOM, Solarwinds, Prometheus, Grafana and Elastic Stack (ELK)
Hands-on experience with BMC Helix Operations Manager, TrueSight, or similar enterprise monitoring platforms
Solid understanding of AIOps concepts, including event correlation, noise reduction, anomaly detection, and root cause analysis
Strong proficiency with scripting (e.g., Python, PowerShell, Bash) for automation and data handling
Solid understanding of networking fundamentals
Excellent problem-solving skills with the ability to diagnose complex issues using observability tools and logs
Exposure to cloud-native monitoring for platforms such as Azure Monitor, AWS CloudWatch, or Google Cloud Operations
Experience with implementing self-healing alerts/systems based on tools such as VMWare VCF Operations, Syslog Splunk and VMWare Loginsight
Proficiency with observability of Kubernetes clusters
Educational Background:
Bachelor’s degree in computer science; information technology or a related field.
Professional Experience:
Minimum of 3 years of experience in Infrastructure Observability Engineering.
Competencies
Problem-solving
Ability to improve business processes
Able to use initiative
Strategic planning
Key Relationships
Outsourced Event & Impact Management Team
Outsourced Monitoring Administration Teams
Engineering Teams (Platform, Windows, Networks, SQL Server & Oracle)
Vendor management
Change, Incident & Problem Manager
Outsourced IT management
Department
Trafigura Group IT provides shared services across the Trafigura group of companies, offering services at scale where it makes economic sense.
Reporting Structure
The engineer will report to the Platform Architect and will join a team of six other engineers who work in a collaborative team covering the Storage, Linux and Virtualisation towers.
Equal Opportunity Employer
We are an Equal Opportunity Employer and take pride in a diverse workforce. We do not discriminate in recruitment, hiring, training, promotion or other employment practices for reasons of race, colour, religion, gender, sexual orientation, national origin, age, marital or veteran status, medical condition or handicap, disability, or any other legally protected status.
#J-18808-Ljbffr- Location:
- London, England, United Kingdom
- Salary:
- £125,000 - £150,000
- Category:
- IT & Technology
We found some similar jobs based on your search
-
New Yesterday
Splunk ITSI Expert / Observability Engineer (Level 4)
-
London
-
£300 - £380 /day
- Engineering
We are seeking a highly experienced Splunk ITSI Expert with 10+ years in observability to enhance our monitoring and analytics capabilities. Key Responsibilities: Design and implement advanced monitoring strategies using Splunk IT Service Intellige...
More Details -
-
New Yesterday
Observability and Automation Engineer
-
Belfast, Northern Ireland, United Kingdom
-
£80,000 - £100,000
- Engineering
Social network you want to login/join with: Observability and Automation Engineer, Belfast col-narrow-left Client: Location: Belfast, United Kingdom Job Category: Other - EU work permit required: Yes col-narrow-right Job Reference: dd40b2ad4baf...
More Details -
-
New Yesterday
Senior Platform Engineer, Observability
-
London, England, United Kingdom
-
£125,000 - £150,000
- IT & Technology
At Forter, you’ll have the chance to make a direct impact on the developer experience across the company while working with cutting-edge observability technologies and practices. We value innovation , collaboration , and continuous improvement ,...
More Details -
-
New Yesterday
Software Engineer, Observability New York
-
London, England, United Kingdom
-
£150,000 - £200,000
- IT & Technology
About Vercel: Vercel gives developers the tools and cloud infrastructure to build, scale, and secure a faster, more personalized web. As the team behind v0, Next.js, and AI SDK, Vercel helps customers like Ramp, Supreme, PayPal, and Under Armour bu...
More Details -
-
1 Days Old
BOMS Monitoring and Observability Engineer
-
Shropshire
Role Overview As a BOMS Monitoring Engineer, you will work within the Business Outcomes & Monitoring Solutions (BOMS) team-a multi-client centre of excellence delivering operational monitoring capabilities and tooling solutions that drive business in...
More Details -
-
2 Days Old
Observability Engineer (Network Operations)
-
Belfast
- Engineering | Technician
Our client, a global leader in electronic trading solutions, is seeking an Observability Engineer to join their Production Support team. This role is pivotal in ensuring the stability, performance, and reliability of high-availability trading platfor...
More Details -