We are looking for an experienced AI Engineer to design, build, and optimize AI-powered applications, leveraging both traditional machine learning and large language models (LLMs). The ideal candidate will have a strong foundation in LLM fine-tuning, inference optimization, backend development, and MLOps, with the ability to deploy scalable AI systems in production environments.
ShyftLabs is a leading data and AI company, helping enterprises unlock value through AI-driven products and solutions. We specialize in data platforms, machine learning models, and AI-powered automation, offering consulting, prototyping, solution delivery, and platform scaling. Our Fortune 500 clients rely on us to transform their data into actionable insights.
Key Responsibilities:
Design and implement traditional ML and LLM-based systems and applications.
Optimize model inference for performance and cost-efficiency.
Fine-tune foundation models using methods like LoRA, QLoRA, and adapter layers.
Develop and apply prompt engineering strategies including few-shot learning, chain-of-thought, and RAG.
Build robust backend infrastructure to support AI-driven applications.
Implement and manage MLOps pipelines for full AI lifecycle management.
Design systems for continuous monitoring and evaluation of ML and LLM models.
Create automated testing frameworks to ensure model quality and performance.
Basic Qualifications:
Bachelor’s degree in Computer Science, AI, Data Science, or a related field.
4+ years of experience in AI/ML engineering, software development, or data-driven solutions.
LLM Expertise
Experience with parameter-efficient fine-tuning (LoRA, QLoRA, adapter layers).
Understanding of inference optimization techniques: quantization, pruning, caching, and serving.
Skilled in prompt engineering and design, including RAG techniques.
Familiarity with AI evaluation frameworks and metrics.
Experience designing automated evaluation and continuous monitoring systems.
Backend Engineering
Strong proficiency in Python and frameworks like FastAPI or Flask.
Experience building RESTful APIs and real-time systems.
Knowledge of vector databases and traditional databases.
Hands-on experience with cloud platforms (AWS, GCP, Azure) focusing on ML services.
MLOps & Infrastructure
Familiarity with model serving tools (vLLM, SGLang, TensorRT).
Experience with Docker and Kubernetes for deploying ML workloads.
Ability to build monitoring systems for performance tracking and alerting.
Experience building evaluation systems using custom metrics and benchmarks.
Proficient in CI/CD and automated deployment pipelines.
Experience with orchestration tools like Airflow.
Hands-on experience with LLM frameworks (Transformers, LangChain, LlamaIndex).
Familiarity with LLM-specific monitoring tools and general ML monitoring systems.
Experience with distributed training and inference on multi-GPU environments.
Knowledge of model compression techniques like distillation and quantization.
Experience deploying models for high-throughput, low-latency production use.
Research background or strong awareness of the latest developments in LLMs.
Tools & Technologies We Use
Frameworks: PyTorch, TensorFlow, Hugging Face Transformers
Serving: vLLM, TensorRT-LLM, SGlang, OpenAI API
Infrastructure: Docker, Kubernetes, AWS, GCP
Databases: PostgreSQL, Redis, Vector Databases
We are proud to offer a competitive salary alongside a strong healthcare insurance and benefits package. We pride ourselves on the growth of our employees, offering extensive learning and development resources.
Diese Cookies sind für das Funktionieren der Website erforderlich und können in unseren Systemen nicht abgeschaltet werden. Sie können Ihren Browser so einstellen, dass er diese Cookies blockiert, aber dann könnten einige Teile der Website nicht funktionieren.
Sicherheit
Benutzererfahrung
Zielgruppenorientierte Cookies
Diese Cookies werden über unsere Website von unseren Werbepartnern gesetzt. Sie können von diesen Unternehmen verwendet werden, um ein Profil Ihrer Interessen zu erstellen und Ihnen an anderer Stelle relevante Werbung zu zeigen.
Google Analytics
Google Ads
Wir benutzen Cookies
🍪
Unsere Website verwendet Cookies und ähnliche Technologien, um Inhalte zu personalisieren, das Nutzererlebnis zu optimieren und Werbung zu indvidualisieren und auszuwerten. Indem Sie auf Okay klicken oder eine Option in den Cookie-Einstellungen aktivieren, stimmen Sie dem zu.
Die besten Remote-Jobs per E-Mail
Schliess dich über 5'000+ Personen an, die wöchentlich Benachrichtigungen über Remote-Jobs erhalten!