Senior, Software Engineer - ML Frameworks presso Torc Robotics
Torc Robotics · Ann Arbor, Stati Uniti d'America · Onsite
- Senior
- Ufficio in Ann Arbor
About the Company
At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.
A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight.
Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.
Meet the Team
Torc's virtual driver software utilizes cutting-edge deep learning techniques to perceive the vehicle's environment, predict the movements of other vehicles, and execute accurate driving decisions. The Machine Learning Frameworks Team builds the core infrastructure that powers this innovation — enabling scalable training, evaluation, and deployment of our models.
We are seeking a Senior Software Engineer to lead the design and development of our machine learning training and data infrastructure. This role is an opportunity to take ownership of foundational components of Torc’s ML platform, mentor other engineers, and drive architectural decisions that directly impact the future of autonomous trucking.
What You’ll Do
Lead the design and development of distributed training frameworks built on Ray and PyTorch Lightning, enabling scalable model training across large datasets and multi-GPU/multi-node clusters.
Architect and implement high-performance pipelines for data ingestion, transformation, and delivery to ML training and evaluation workflows.
Own and evolve shared ML libraries that serve as the foundation for all ML development at Torc.
Collaborate closely with research, perception, and planning teams to understand requirements and translate them into reusable infrastructure solutions.
Improve developer productivity by building robust internal tools, APIs, and automation for ML Ops.
Optimize system performance, GPU/CPU utilization, and cloud resource efficiency.
Champion best practices in software engineering, cloud-native ML infrastructure, and distributed systems.
Provide technical mentorship and code reviews for junior and mid-level engineers.
Participate in the on-call rotation to ensure the reliability of ML infrastructure services.
Drive adoption of new technologies, evaluate trade-offs, and align technical decisions with long-term autonomy goals.
What We’re Looking For
Bachelor’s degree in Computer Science, Data Science, Artificial Intelligence, or related field with 5+ years of professional software engineering experience.
Strong proficiency in Python, with a deep understanding of software engineering best practices (testing, CI/CD, version control, code reviews, agile workflows).
Demonstrated experience with ML frameworks such as PyTorch or TensorFlow; hands-on experience with PyTorch Lightning is a plus.
Solid experience with distributed computing and/or frameworks like Ray or Spark.
Strong knowledge of cloud services (AWS preferred), containerization (Docker), and orchestration (Kubernetes).
Proven ability to design scalable ML training pipelines and APIs.
Track record of leading projects, influencing cross-functional teams, and mentoring other engineers.
Excellent communication and collaboration skills with the ability to align technical solutions to business needs.
Bonus Points!
Hands-on experience with ML Ops workflows: dataset management, model registries, automated training/evaluation pipelines.
Expertise in GPU programming, CUDA optimization, or performance tuning for ML workloads.
Familiarity with large-scale data storage solutions (e.g., S3, Lustre, feature stores).
Experience in the autonomous vehicle, robotics, or real-time systems space.
Contributions to open-source ML infrastructure frameworks.
Why Join Us?
This role is a chance to be a technical leader in Torc’s ML infrastructure stack, directly enabling the scalability and reliability of our AI systems for autonomous trucking. You’ll shape the next generation of distributed training systems, mentor other engineers, and see your work power real-world autonomy.
Perks of Being a Full-time Torc’r
Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers:
- A competitive compensation package that includes a bonus component and stock options
- 100% paid medical, dental, and vision premiums for full-time employees
- 401K plan with a 6% employer match
- Flexibility in schedule and generous paid vacation (available immediately after start date)
- AD+D and Life Insurance
At Torc, we’re committed to building a diverse and inclusive workplace. We celebrate the uniqueness of our Torc’rs and do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, veteran status, or disabilities.
Even if you don’t meet 100% of the qualifications listed for this opportunity, we encourage you to apply.
Job ID: 102382