Platzhalter Bild

AI & Genomics Intern bei Instadeep

Instadeep · Paris, Frankreich · Onsite

Jetzt bewerben
InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.

Join us to be a part of the AI revolution!

We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.


We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.

Responsibilities
  • Develop a reproducible pipeline for genomic data ingestion, cleaning, and quality control.
  • Collect, organize, and document datasets from public or internal sources.
  • Benchmark machine learning and deep learning models using JAX and PyTorch.
  • Integrate multi-modal features such as genomic variants, phenotype measurements, and metadata.
  • Evaluate models for performance, interpretability, and generalization across datasets.
  • Deliver clean code, reproducible pipelines, a technical report, and a final presentation.


  • Requirements
  • Strong programming skills in Python (pandas, numpy, scikit-learn).
  • Practical experience with deep learning frameworks (JAX and PyTorch).
  • Good understanding of genomics databases and handling large biological datasets.
  • Knowledge of genomics handling basics: FASTQ/BAM/VCF formats, SNPs, haplotypes.
  • Ability to work with autonomy, while communicating results effectively.
  • Good coding practices (Git, testing).


  • Preferred Qualifications
  • Experience with bioinformatics tools (samtools, bcftools).
  • Familiarity with workflow frameworks (Snakemake, Nextflow).
  • Statistical genetics (GWAS, mixed models, GBLUP).
  • Experience with multi-modal biological datasets.


  • What You Will Learn
  • Applying state-of-the-art AI frameworks (JAX, PyTorch) to genomic data.
  • Building end-to-end pipelines for reproducible data processing
  • Designing benchmarks to compare models and evaluate robustness.
  • Collecting and curating large-scale datasets for real-world genomics applications.


  • *Please submit your CV/Resume in English*
    Duration: 6 Months internship

    Our commitment to our people
    We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.

    Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.
    Jetzt bewerben

    Weitere Jobs