
Senior Technical Program Manager, AI and ML Software
Nvidia · Santa Clara, United States Of America · Hybrid
10000 Remote & work from home jobs online
Nvidia · Santa Clara, United States Of America · Hybrid
Motorola Solutions · Washington, United States Of America · Remote
Scj · Racine, United States Of America · Hybrid
HP · Palo Alto, United States Of America · Onsite
Unilever · Hoboken, United States Of America · Onsite
Nyp · New York City, United States Of America · Onsite
Trumed · Kansas City, United States Of America · Onsite
University System of New Hampshire · Concord, United States Of America · Onsite
Penn State · University Park, United States Of America · Onsite
Penn State University · University Park, United States Of America · Onsite
Ameriprise · Minneapolis, United States Of America · Hybrid
GOODWIN UNIVERSITY EDUCATIONAL SERVICES INC · East Harford, United States Of America · Onsite
Ulster Savings · Kingston, United States Of America · Onsite
GORUCK · Jacksonville Beach, United States Of America · Onsite
Nvidia · Santa Clara, United States Of America · Hybrid
Hardware Infrastructure is seeking a Senior Technical Program Manager to own the strategy and execution of programs to support the bringup, operations and automation of GPU infrastructure. The GPU infrastructure we build and operate enables NVIDIAs most sophisticated AI and hardware researchers and engineers to invent the future of computing. This is a fast paced and evolving landscape that requires a senior TPM leader to guide engineering roadmaps to be delivered with high quality outcomes and a strong foundation of operational perfection. They will partner both internally within Hardware Infrastructure and externally with senior management and partner teams to scale the clusters operations charter. They will develop and standardize planning, reporting and execution methodologies and metrics to enable meeting the challenging objectives.
What You'll Be Doing:
Engage with cross-company partners to compose the technical strategy, build programs and coordinate execution to meet key business objectives that support scaling bringups to be seamless, fast and efficient
Nurture a culture of continuous improvement, finding new opportunities across tooling, automation and processes to scale cluster operations and management
Guide a diverse set of engineering efforts in an agile program methodology across planning, prioritization, design, dependency management, implementation and execution.
Bring data first approach to programs (metrics, OKRs, KPIs) to effectively measure program success and for identifying areas of improvement
Create effective communication channels to provide varying audience levels insights into program status, risks and opportunities.
Act as an effective technical and non-technical liaison between developers, customers and partners to drive organization alignment across a multi-functional matrixed set of leads
What We Need To See:
B.S. (or equivalent experience) in Computer Science or a related technical difficulty
10+ years of experience across software engineering and/or technical program management roles with demonstrated expertise and mastery of technical and management practices
Showed skill in infrastructure software, production application software development and large scale distributed computing
Experience leading large scale HPC and/or AI Infrastructure deployments that stretch across hardware and software
Outstanding communication and presentation abilities suited for a wide range of technical and non-technical viewers
Strong multitasking abilities with a focus on thoroughness and rapid context switching
Knowledge of agile methodologies and the best in class project management tools
Proactive and enthusiastic in identifying and implementing positive changes in software engineering and release management within a fast-paced environment
Ways To Stand Out From The Crowd:
Prior experience bringing up new datacenter capacity across cloud service providers and on-premise locations
Prior experience migrating platforms and solutions from on prem to cloud
Background in working with AI researchers and/or EDA developers
Software development, release and support methodology and devops
You will also be eligible for equity and benefits.