InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.
Join us to be a part of the AI revolution!
We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.
We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.
Responsibilities
Develop a reproducible pipeline for genomic data ingestion, cleaning, and quality control.
Collect, organize, and document datasets from public or internal sources.
Benchmark machine learning and deep learning models using JAX and PyTorch.
Integrate multi-modal features such as genomic variants, phenotype measurements, and metadata.
Evaluate models for performance, interpretability, and generalization across datasets.
Deliver clean code, reproducible pipelines, a technical report, and a final presentation.
Requirements
Strong programming skills in Python (pandas, numpy, scikit-learn).
Practical experience with deep learning frameworks (JAX and PyTorch).
Good understanding of genomics databases and handling large biological datasets.
Knowledge of genomics handling basics: FASTQ/BAM/VCF formats, SNPs, haplotypes.
Ability to work with autonomy, while communicating results effectively.
Good coding practices (Git, testing).
Preferred Qualifications
Experience with bioinformatics tools (samtools, bcftools).
Familiarity with workflow frameworks (Snakemake, Nextflow).
Statistical genetics (GWAS, mixed models, GBLUP).
Experience with multi-modal biological datasets.
What You Will Learn
Applying state-of-the-art AI frameworks (JAX, PyTorch) to genomic data.
Building end-to-end pipelines for reproducible data processing
Designing benchmarks to compare models and evaluate robustness.
Collecting and curating large-scale datasets for real-world genomics applications.
*Please submit your CV/Resume in English*
Duration: 6 Months internship
Our commitment to our people
We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.
Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.
Ces cookies sont nécessaires au fonctionnement du site web et ne peuvent pas être désactivés dans nos systèmes. Vous pouvez configurer votre navigateur pour qu'il bloque ces cookies, mais certaines parties du site risquent alors de ne pas fonctionner.
Sécurité
Expérience utilisateur
Cookies ciblés
Ces cookies sont placés par nos partenaires publicitaires via notre site web. Ils peuvent être utilisés par ces entreprises pour créer un profil de vos intérêts et vous montrer des publicités pertinentes ailleurs.
Google Analytics
Google Ads
Nous utilisons des cookies
🍪
Notre site web utilise des cookies et des technologies similaires pour personnaliser le contenu, optimiser l'expérience de l'utilisateur, individualiser et évaluer la publicité. En cliquant sur OK ou en activant une option dans les paramètres des cookies, vous acceptez cela.
Les meilleurs emplois à distance par courriel
Rejoins 5'000+ personnes qui reçoivent des alertes hebdomadaires avec des emplois à distance!