Hybrid ML Engineer na Fundamental Research Labs
Fundamental Research Labs · Menlo Park, Estados Unidos Da América · Hybrid
- Professional
- Escritório em Menlo Park
About the Role
As a Machine Learning Engineer, you’ll push the frontier of post-training for large language models from supervised fine-tuning to advanced reinforcement learning to train powerful agents capable of complex, end-to-end reasoning and action.
You’ll work side-by-side with our researchers to rapidly iterate on alignment strategies, implement state-of-the-art training algorithms, and run large-scale experiments that directly improve our production models.
If you love making models smarter, safer, and more capable, and want to work at the cutting edge of LLM agent training, this role is for you.
Responsibilities
Implement and optimize SFT, DPO, GRPO, PPO, and other RL approaches for large-scale models
Develop reward models and evaluation pipelines for agentic LLM behavior
Train and refine end-to-end reinforcement learning agents capable of multi-step reasoning and tool use
Integrate experimental training results into production-facing agent architectures
Collaborate with researchers to design, run, and analyze large-scale training experiments
Optimize training efficiency
Qualifications
Strong background in machine learning and reinforcement learning
Proficiency with PyTorch
Hands-on experience implementing post-training pipelines for large models
Strong software engineering fundamentals in Python
Bonus
Experience training agentic LLMs with end-to-end reinforcement learning
Prior research experience
What makes us interesting
Small, elite team of ex-founders, researchers from top AI Labs, top CS grads, and engineers from top companies
True ownership You will not be blocked by bureaucracy, shipping meaningful work within weeks rather than months
Serious momentum We're well-funded by top investors, moving fast, and focused on execution
What we do
Ship consumer products powered by cutting-edge AI research, and
Build infrastructure that facilitates research and product, and
Innovate cutting-edge research that will open up new consumer product forms
The Details
Full-time, onsite role in Menlo Park
Startup hours apply
Generous salary, with additional benefits to be discussed during the hiring process