Data Engineer

New Yesterday

Overview

GlaxoSmithKline (GSK) is a science-led global healthcare company dedicated to helping people do more, feel better, and live longer. We are advancing vaccines and medicines through cutting-edge science, technology, and data, with a focus on collaborative R&D to pre-empt and defeat diseases. Join us in uniting science, technology, and talent to get ahead of disease together.

Position Summary

We are seeking a highly skilled Backend Engineer to contribute to our vision of using advanced applications of Machine Learning and AI to develop novel therapies and enable rapid responses to emerging diseases with personalized drugs. The ideal candidate will have a track record of shipping data products derived from complex sources and will own the process from conceptual data pipelines to production-scale deployment. You will work with modern cloud tooling to deliver reliable data pipelines and continuously improve them.

This role requires a passion for solving challenging problems aligned to Artificial Intelligence and Machine Learning applications. An educational or professional background in the biological sciences is a plus but not required; a demonstrated passion to help therapies for new and existing diseases and a pattern of continuous learning are mandatory.

Responsibilities

  • Build data pipelines using modern data engineering tools on Google Cloud: Python, Spark, SQL, BigQuery, Cloud Storage.
  • Ensure data pipelines meet the specific scientific needs of data consuming applications.
  • Deliver high-quality software implementations according to best practices, including automated test suites and documentation.
  • Develop, measure, and monitor key metrics for all tools and services and iterate to improve them.
  • Participate in code reviews and raise the standard of the team and product.
  • Liaise with other technical staff and data engineers to build an end-to-end pipeline consuming other data products.
  • 2+ years of data engineering experience with a relevant Bachelor’s degree or equivalent experience.
  • Cloud experience (Google Cloud preferred).
  • Strong skills in Python and SQL.
  • Unit testing experience (e.g., pytest).
  • Knowledge of agile practices and ability to work in agile software development environments.
  • Experience with modern software development tools (e.g., Git/GitHub, DevOps deployment tools).

PreferredQualifications

  • Experience with biological or scientific data (e.g., genomics, transcriptomics, proteomics) or pharmaceutical industry experience.
  • Bioinformatics expertise and familiarity with large-scale bioinformatics datasets.
  • Experience using Nextflow pipelines.
  • Knowledge of NLP techniques and processing unstructured data, including vector stores and approximate retrieval.
  • Familiarity with orchestration tooling (e.g., Airflow or Google Workflows).
  • Experience with AI/ML-powered applications.
  • Experience with Docker or containerized applications.

Why GSK?

Uniting science, technology and talent to get ahead of disease together.

GSK is a global biopharma company focusing on vaccines, specialty and general medicines, investing in four core therapeutic areas and leveraging new platform and data technologies to improve health outcomes.

Our success depends on our people. We strive to create a workplace where everyone can thrive, feel welcomed, valued, and included, with opportunities for growth and wellbeing.

Inclusion at GSK

GSK is an Equal Opportunity/Affirmative Action Employer. All qualified applicants will receive equal consideration for employment without regard to race, color, national origin, religion, sex, pregnancy, marital status, sexual orientation, gender identity/expression, age, disability, genetic information, military service, or any other protected status.

If you need adjustments in the recruitment process, please contact our Recruitment team (EMEA-GSKLink@allegisglobalsolutions.com) to discuss today.

#J-18808-Ljbffr
Location:
City Of London
Job Type:
FullTime
Category:
IT & Technology

We found some similar jobs based on your search