Leave us your email address and we'll send you all the new jobs according to your preferences.

Bioinformatician

Posted 53 minutes 34 seconds ago by Nucleus Capital I. GMBH

Permanent
Full Time
Other
London, United Kingdom
Job Description
What we're looking for

This role will be based in London, with close collaboration across ML, data and scientific product. As part of the Computational Biology team, you will lead on the development of robust, scalable omics workflows that power our discovery platform, with an initial focus on plant pangenomes. You will review and evolve our existing pipelines, define how pangenomes should strengthen discovery projects, and build the computational foundations that connect public and internal omics data to downstream ML and target discovery. Working across data, ML and plant science, you will own pangenome creation and curation, drive innovation in workflows spanning gene expression and variant calling, and contribute to scientific software that is reproducible, efficient and production-ready.

Your first priorities will be to
  • Review our existing omics pipelines, focussing on pangenomes and RNA seq to begin with
  • Define and implement a strategy to improve our discovery projects using pangenomes
Core responsibilities
  • Own pangenome creation and curation
  • Own omics pipeline maintenance & curation quality
  • Drive innovation in our omics pipelines, including pangenomes, gene expression and variant calling feeding into our in house data warehouses for downstream ML and discovery applications
  • Work closely together with the data team to ensure seamless data onboarding and processing
  • Support public data QC and onboarding for discovery projects
Additional Responsibilities
  • Support in house discovery projects
  • Help maintain and drive high standards in our scientific code base
  • Optimize resource utilisation of in house Nextflow workflows
Core competencies
  • Extensive experience building and working with pangenomes in plants
  • Extensive hands on experience with Nextflow and omics pipelines
  • Strong coding proficiency in Python
  • High familiarity with most common omics file formats such as FASTQ, VCF, HAL, GFF
  • Familiarity with Linux environment and common bioinformatics command line tools such as seqkit, bcftools, samtools to query files quickly and effectively
  • Experience with different public omics data repositories, such as NCBI, ENSEMBL, JGI, Solgenomics etc
  • Ability to communicate clearly across disciplines. You will work daily with ML engineers, data engineers, and plant scientists, and need to translate real scientific workflows into robust software
  • Ability to understand requirements of cross functional teams and the ability to synthesise complex information
  • Experience working in scientific, biotech, or high integrity domains where reproducibility and auditability matter
Nice to have competencies
  • Familiarity with nf core design principles
  • Experience with RNA seq mapping for pangenomes
  • Familiarity with version control; using git, GitHub, GitLab, GitBucket or similar
Benefits
  • Competitive salary & equity options
  • 25 days annual leave & option for 2 weeks work from anywhere policy
  • Benefits package
  • Career development opportunities as the company scales
  • Ownership of ambitious, mission driven work with real world impact
  • Vibrant, innovative & supportive work environment with a committed team
  • Access to conferences, events & professional development resources
Email this Job