Machine Learning Infrastructure Engineer

New Yesterday

Machine Learning Infrastructure Engineer

This role is based in London with a base pay up to £150000. Actual pay based on skills and experience.

Overview

A cutting-edge AI start-up is pioneering the development of frontier 3D foundation models, pushing the boundaries of computer vision and spatial computing. Their mission is to redefine how industries such as robotics, AR/VR, gaming, and film generate and interact with 3D content. They are seeking an ML & Cloud Infrastructure Engineer to join their growing team. This is a unique opportunity to work at the forefront of AI innovation, building the infrastructure that underpins complex ML workloads and production systems. You’ll play a central role in scaling the company’s platforms and ensuring their pioneering technology reaches its full potential.

Responsibilities

  • Develop and maintain scalable, high-performance cloud-based infrastructure for ML workloads and API deployment.
  • Manage and optimize cloud platforms (AWS, Azure, GCP) and set up ML nodes for local and distributed training.
  • Install, configure, and monitor servers, ensuring system reliability.
  • Design and optimize storage solutions for large-scale ML datasets.
  • Manage containerized applications with Docker, Kubernetes, Terraform, and related tools.
  • Collaborate with ML engineers and researchers to ensure seamless orchestration of training and production environments.
  • Troubleshoot and respond to cloud/production incidents, implementing long-term solutions.

What We’re Looking For

  • At least 3 years of professional experience in a cloud-related engineering role (ML-related experience highly desirable).
  • Proven expertise in at least one major cloud platform (AWS, GCP, or Azure).
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Ability to manage and optimize large-scale cloud infrastructure.
  • Familiarity with Python (Jupyter) and ML frameworks (e.g., PyTorch).
  • Experience with cloud monitoring tools (Prometheus, Grafana).
  • Exposure to cloud-based databases (RDS, Aurora, Spanner, etc.) and data-visualisation tools.
  • Knowledge of CI/CD tools (e.g., CircleCI).

Details

  • Seniority level: Mid-Senior level
  • Employment type: Full-time
  • Job function: Engineering and Science
  • Industries: Technology, Information and Media

Get notified about new Infrastructure Engineer jobs in London Area, United Kingdom.

#J-18808-Ljbffr
Location:
United Kingdom
Job Type:
FullTime
Category:
Engineering