InstaDeep, founded in 2014, is a pioneering AI company at the forefront of innovation. With strategic offices in major cities worldwide, including London, Paris, Berlin, Tunis, Kigali, Cape Town, Boston, and San Francisco, InstaDeep collaborates with giants like Google DeepMind and prestigious educational institutions like MIT, Stanford, Oxford, UCL, and Imperial College London. We are a Google Cloud Partner and a select NVIDIA Elite Service Delivery Partner. We have been listed among notable players in AI, fast-growing companies, and Europe's 1000 fastest-growing companies in 2022 by Statista and the Financial Times. Our recent acquisition by BioNTech has further solidified our commitment to leading the industry.
Join us to be a part of the AI revolution!
We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.
We are seeking a motivated AI & Genomics Intern to contribute to an applied research project at the intersection of machine learning and genomics. The project will focus on designing and evaluating methods that combine genomic and phenotypic data to improve predictive modeling, benchmarking, and variant interpretation.
Responsibilities
Develop a reproducible pipeline for genomic data ingestion, cleaning, and quality control.
Collect, organize, and document datasets from public or internal sources.
Benchmark machine learning and deep learning models using JAX and PyTorch.
Integrate multi-modal features such as genomic variants, phenotype measurements, and metadata.
Evaluate models for performance, interpretability, and generalization across datasets.
Deliver clean code, reproducible pipelines, a technical report, and a final presentation.
Requirements
Strong programming skills in Python (pandas, numpy, scikit-learn).
Practical experience with deep learning frameworks (JAX and PyTorch).
Good understanding of genomics databases and handling large biological datasets.
Knowledge of genomics handling basics: FASTQ/BAM/VCF formats, SNPs, haplotypes.
Ability to work with autonomy, while communicating results effectively.
Good coding practices (Git, testing).
Preferred Qualifications
Experience with bioinformatics tools (samtools, bcftools).
Familiarity with workflow frameworks (Snakemake, Nextflow).
Statistical genetics (GWAS, mixed models, GBLUP).
Experience with multi-modal biological datasets.
What You Will Learn
Applying state-of-the-art AI frameworks (JAX, PyTorch) to genomic data.
Building end-to-end pipelines for reproducible data processing
Designing benchmarks to compare models and evaluate robustness.
Collecting and curating large-scale datasets for real-world genomics applications.
*Please submit your CV/Resume in English*
Duration: 6 Months internship
Our commitment to our people
We empower individuals to celebrate their uniqueness here at InstaDeep. Our team comes from all walks of life, and we’re proud to continue encouraging and supporting applicants from underrepresented groups across the globe. Our commitment to creating an authentic environment comes from our ability to learn and grow from our diversity, and how better to experience this than by joining our team? We operate on a hybrid work model with guidance to work at the office 3 days per week to encourage close collaboration and innovation. We are continuing to review the situation with the well-being of InstaDeepers at the forefront of our minds.
Right to work: Please note that you will require the legal right to work without visa sponsorship in the location you are applying for. We do not sponsor work visas.
Estas cookies son necesarias para que el sitio web funcione y no se pueden desactivar en nuestros sistemas. Puede configurar su navegador para bloquear estas cookies, pero entonces algunas partes del sitio web podrían no funcionar.
Seguridad
Experiencia de usuario
Cookies orientadas al público objetivo
Estas cookies son instaladas a través de nuestro sitio web por nuestros socios publicitarios. Estas empresas pueden utilizarlas para elaborar un perfil de sus intereses y mostrarle publicidad relevante en otros lugares.
Google Analytics
Anuncios Google
Utilizamos cookies
🍪
Nuestro sitio web utiliza cookies y tecnologías similares para personalizar el contenido, optimizar la experiencia del usuario e indvidualizar y evaluar la publicidad. Al hacer clic en Aceptar o activar una opción en la configuración de cookies, usted acepta esto.
Los mejores empleos remotos por correo electrónico
¡Únete a más de 5.000 personas que reciben alertas semanales con empleos remotos!