Platzhalter Bild

Lead Engineer (AI/ML - Training & Experimentation) bei Prodapt

Prodapt · Chennai, Indien · Onsite

Jetzt bewerben
Overview:

Join the Prodapt team in building a unified, cloud-native environment for scalable machine learning training and experimentation. You will help design, develop, and optimize robust workflows that empower data scientists and engineers to efficiently explore, train, and validate ML models at scale.

Responsibilities:

• Develop, maintain, and enhance interactive Jupyter-based notebook environments for model development, experimentation, and training.
• Build and optimize Pipelines (file-based and git-based) for orchestrating training and experiment jobs, ensuring seamless transition from research to production.
• Integrate and manage Project Namespace storage for organizing research data and experiment artifacts.
• Enable flexible compute resource allocation (CPU/GPU) for diverse ML workloads, including deep learning, NLP, and recommendation systems.
• Implement and support experiment tracking tools for managing model versions, metadata, and reproducibility.
• Develop and optimize data loading and processing workflows using technologies such as Spark, cuDF, RAPIDS, and NVTabular.
• Leverage Google Cloud Platform (GCP) for scalable compute, storage, and orchestration of training jobs.
• Ensure robust model governance, including lineage tracking, metadata management, and compliance.
• Collaborate with data scientists, ML engineers, and platform teams to deliver reliable, production-grade solutions.
• Participate in code reviews, architecture discussions, and continuous improvement of the platform.

Requirements:

Required Technical Skills

  • Proficiency in Python, especially for ML model development and data engineering.
  • Experience with Jupyter Notebooks and interactive development environments.
  • Familiarity with distributed data processing frameworks (Spark, cuDF, RAPIDS, NVTabular).
  • Experience with cloud platforms (GCP preferred) and scalable compute resource management (CPU/GPU).
  • Strong understanding of experiment tracking, model versioning, and metadata management.
  • Experience with CI/CD tools and automation for ML workflows.
  • Knowledge of model governance, reproducibility, and compliance best practices.
  • Excellent troubleshooting, debugging, and communication skills.

Preferred Qualifications

  • Experience with Pipeline orchestration (file-based and git-based).
  • Familiarity with ecosystem or large-scale financial/ML platforms.
  • Exposure to research and production environment isolation for ML workflows.
  • Experience with dashboarding and monitoring tools for training jobs and experiment outcomes.
  • Knowledge of data privacy, security, and compliance in ML environments.
Jetzt bewerben

Weitere Jobs