Platzhalter Bild

Hybrid ML Engineer na Fundamental Research Labs

Fundamental Research Labs · Menlo Park, Estados Unidos Da América · Hybrid

Candidatar-se agora

About the Role

As a Machine Learning Engineer, you’ll push the frontier of post-training for large language models from supervised fine-tuning to advanced reinforcement learning to train powerful agents capable of complex, end-to-end reasoning and action.

You’ll work side-by-side with our researchers to rapidly iterate on alignment strategies, implement state-of-the-art training algorithms, and run large-scale experiments that directly improve our production models.

If you love making models smarter, safer, and more capable, and want to work at the cutting edge of LLM agent training, this role is for you.

Responsibilities

  • Implement and optimize SFT, DPO, GRPO, PPO, and other RL approaches for large-scale models

  • Develop reward models and evaluation pipelines for agentic LLM behavior

  • Train and refine end-to-end reinforcement learning agents capable of multi-step reasoning and tool use

  • Integrate experimental training results into production-facing agent architectures

  • Collaborate with researchers to design, run, and analyze large-scale training experiments

  • Optimize training efficiency

Qualifications

  • Strong background in machine learning and reinforcement learning

  • Proficiency with PyTorch

  • Hands-on experience implementing post-training pipelines for large models

  • Strong software engineering fundamentals in Python

Bonus

  • Experience training agentic LLMs with end-to-end reinforcement learning

  • Prior research experience

What makes us interesting

  • Small, elite team of ex-founders, researchers from top AI Labs, top CS grads, and engineers from top companies

  • True ownership You will not be blocked by bureaucracy, shipping meaningful work within weeks rather than months

  • Serious momentum We're well-funded by top investors, moving fast, and focused on execution

What we do

  • Ship consumer products powered by cutting-edge AI research, and

  • Build infrastructure that facilitates research and product, and

  • Innovate cutting-edge research that will open up new consumer product forms

The Details

  • Full-time, onsite role in Menlo Park

  • Startup hours apply

  • Generous salary, with additional benefits to be discussed during the hiring process

Candidatar-se agora

Outros empregos