Data Engineer - GSK0JP00107167

New Today

OverviewData EngineerPay: £887.63Term: 6 months (potential extension)Location: Kings Cross, or Stevenage (depending on preference)Our pharmaceutical client see a world in which advanced applications of Machine Learning and AI will allow them to develop novel therapies to existing diseases and to quickly respond to emerging or changing diseases with personalized drugs, driving better outcomes at reduced cost with fewer side effects. It is an ambitious vision that will require the development of products and solutions at the cutting edge of Machine Learning and AI. They\'re looking for a highly skilled data engineer to help us make this vision a reality.Strong candidates will have a track record of shipping data products derived from complex sources, responsible for the process from conceptual data pipelines to production scale. We have a commitment to quality, so successful candidates will be able to use modern cloud tooling and techniques to deliver reliable data pipelines and continuously improve them.This role requires a passion for solving challenging problems aligned to exciting Artificial Intelligence and Machine Learning applications. Educational or professional background in the biological sciences is a plus but is not necessary; passion to help therapies for new and existing diseases, and a pattern of continuous learning and development is mandatory.ResponsibilitiesBuild data pipelines using modern data engineering tools on Google Cloud: Python, Spark, SQL, BigQuery, Cloud StorageEnsure data pipelines meet the specific scientific needs of data consuming applicationsResponsible for high quality software implementations according to best practices, including automated test suites and documentationDevelop, measure, and monitor key metrics for all tools and services and consistently seek to iterate on and improve themParticipate in code reviews, continuously improving personal standards as well as the wider team and productLiaise with other technical staff and data engineers in the team and across allied teams, to build an end-to-end pipeline consuming other data productsBasic qualifications2+ years of data engineering experience with a Bachelors degree in a relevant field (including computational, numerate or life sciences), or equivalent experienceCloud experience (e.g. Google Cloud preferred)Strong skills with industry experience in Python and SQLUnit testing experience (e.g. pytest)Knowledge of agile practices and able to perform in agile software development environmentsStrong experience with modern software development tools / ways of working (e.g. git/GitHub, DevOps tools for deployment)Preferred qualificationsDemonstrated experience with biological or scientific data (e.g. genomics, transcriptomics, proteomics), or pharmaceutical industry experienceBioinformatics expertise, familiarity with large scale bioinformatics datasetsExperience using Nextflow pipelinesKnowledge of NLP techniques and experience of processing unstructured data, using vector stores, and approximate retrievalFamiliarity with orchestration tooling (e.g. Airflow or Google Workflows)Experience with AI/ML powered applicationsExperience with Docker or containerized applications #J-18808-Ljbffr
Location:
City Of London, England, United Kingdom
Job Type:
FullTime

We found some similar jobs based on your search