
Mlops Empregos à distância e ao domicílio em santa-clara ∙ Página 1
17 Empregos à distância e em escritório em casa online


Machine Learning Engineer, Senior Staff - Frameworks
d-Matrix · Santa Clara, Estados Unidos Da América · Hybrid


Senior Manager, Technical Program Management
ServiceNow · Santa Clara, Estados Unidos Da América · Hybrid

Hybrid Senior Product Manager - Observability and Resilience
NVIDIA · Santa Clara, Estados Unidos Da América · Hybrid

Hybrid Distinguished Engineer, GenAI Security
Netskope · Santa Clara, Estados Unidos Da América · Hybrid

Hybrid Solutions Architect, Generative AI Specialist
NVIDIA · Santa Clara, Estados Unidos Da América · Hybrid


Hybrid AI/ML Platform Product Manager
PDF Solutions · Santa Clara, Estados Unidos Da América · Hybrid

Hybrid Principal AI Engineer (Office of the CPO: Innovation Team)
Palo Alto Networks · Santa Clara, Estados Unidos Da América · Hybrid

Hybrid Machine Learning Engineer, Senior Staff - Model Factory
d-Matrix · Santa Clara, California, US, Estados Unidos Da América · Hybrid

Hybrid Staff Engineer, Developer and Qualification Tools
d-Matrix · Santa Clara, California, US, Estados Unidos Da América · Hybrid

Hybrid Machine Learning Engineer, AI Safety - LLM MLOps
Nvidia · Santa Clara, California, US', 'Remote, US, Estados Unidos Da América · Hybrid

Hybrid Senior Staff AI Engineer/ Tech Lead
Xpeng motors · Santa Clara, CA, Estados Unidos Da América · Hybrid

Hybrid Principal Engineer – AI/ML Analytics Platform & Cloud Security
Netskope · Santa Clara, California, Estados Unidos Da América · Hybrid

Hybrid Senior Technical Product Management - AI Infrastructure
NVIDIA · US, CA, Santa Clara, Estados Unidos Da América · Hybrid

Hybrid Senior Solutions Architect, Generative AI
NVIDIA · US, CA, Santa Clara, Estados Unidos Da América · Hybrid
Solutions Architect, DGX Cloud
NVIDIA · Santa Clara, Estados Unidos Da América · Hybrid
- Senior
- Escritório em Santa Clara
Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for a hardworking Solution Architect (SA) to join the DGX Cloud SA Segment Team. The mission of the DGX Cloud Segment team is to guide and enable the successful adoption at scale of DGX Cloud and NVIDIA AI Enterprise Software in production.
NVIDIA DGX Cloud is an AI platform for developers, researchers, and enterprises, optimized for the demands of Generative AI. The DGX Cloud SA team is dedicated to shaping the future of DGX Cloud by actively gathering and incorporating partner feedback and product requirements. Our team will help optimize the onboarding process for NVIDIA Cloud Partners, ensuring fast time to insights and exceptional user experience. Additionally, we will collaborate with internal teams to scale expertise and knowledge through training and the creation of repeatable guides. Our focus on building reliable infrastructure, partner qualifications, and assets will streamline onboarding, ultimately increasing adoption of DGX Cloud.
What you’ll be doing:
Work closely with DGX Cloud Partners, become their trusted technical advisor, advocate for their needs, and ensure they are successful in accomplishing their business goals with the platform.
Accelerate NVIDIA Cloud Partner onboarding time, cluster manageability and reliability.
Scale knowledge, reach, and opportunities by building and educating vertical teams and communities on DGX Cloud and NVIDIA Reference Architectures.
Communicate to our Reference Architecture teams findings gathered from the field.
Provide technical education and facilitate field product feedback to improve DGX Cloud.
Enable partners to participate in the DGX Cloud Ecosystem with the goal of end-user satisfaction and increased sales.
What we need to see:
Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science (or equivalent experience)
5+ years of proven experience with one or more Cloud Service Providers (AWS, Azure, GCP or OCI), NVIDIA Cloud Partners (CoreWeave, Lambda Labs, Crusoe, etc) and cloud-native architectures and software.
Demonstrated experience in technical leadership, strong understanding of NVIDIA technologies, and success in working with customers.
Expertise with parallel filesystems (e.g. Lustre, GPFS, BeeGFS, WekaIO) and high-speed interconnects (InfiniBand, Omni Path, RoCE, and Gig-E).
Strong coding and debugging skills, and demonstrated expertise in one or more of the following areas: Machine Learning, Deep Learning, Slurm, Kubernetes, MPI, MLOps, LLMOps, Ansible, Terraform, and other high-performance AI cluster solutions.
Proficient in deploying GPU applications in Slurm, Kubernetes, docker, helm, registries
Linux-based configuration management and monitoring solutions, system administration, OS installation, configuration, and troubleshooting
Networking technologies (e.g. router, firewall, load balancer, DNS, VPN) for complex infrastructure configuration
Ways to stand out from the crowd:
Experience using DGX Cloud, NVIDIA AI Enterprise AI Software including Base Command Manager, NeMo, and NVIDIA's Inference Microservices.
Experience with AI application development and deployment
Background with deploying and configuring observability tooling including Grafana, Prometheus, W&B, Nagios, Zabbix
Experience with high performance or large-scale computing environments.
You will also be eligible for equity and benefits.