We are seeking a highly motivated and skilled Model Development Engineer to join our rapidly growing AI team in Morrisville, NC. You will play a critical role in the training of large language models (LLMs), large vision models (LVMs), and large multimodal models (LMMs), including fine-tuning and reinforcement learning. This is a challenging yet rewarding opportunity to contribute to cutting-edge research and development in generative AI. You’ll be working with a collaborative team to push the boundaries of what’s possible with AI models and deploy them into innovative products. Responsibilities: Design, implement, and evaluate training pipelines for large generative AI models, encompassing multiple stages of post-training. Data: Design, implement, and evaluate data augmentation pipelines to increase the diversity and robustness of training datasets, improving model performance, particularly in low-data regimes. Model evaluations: Develop and implement model evaluation pipeline for LLMs Supervised Fine-tuning (SFT): Developing and executing SFT strategies for specific tasks. RLHF: Developing and leveraging RLHF algorithms in model training such as DPO and KTO Reinforcement Learning (RL): Exploring RL training strategies, sampling, reward function design and etc. to apply large scale RL to model training Quantization: Implement and evaluate model quantization techniques to reduce model size and accelerate inference speed, balancing precision loss with performance gains for deployment across diverse hardware platforms. Low-Rank Adaptation (LoRA): Utilizing techniques for efficient fine-tuning of large language models, balancing performance and resource constraints, and tailoring model performance for downstream tasks well. Experiment with various training techniques, hyperparameters, and model architectures to optimize performance and efficiency. Develop and maintain data pipelines for processing and preparing training data. Stay up-to-date with the latest advancements in large language models, training techniques, and related technologies. Collaborate with other engineers and researchers to design, implement, and deploy AI-powered products. Contribute to the development of internal tools and infrastructure for model training and evaluation. Qualifications: Ph.D degree in Computer Science, Machine Learning, or a related field. Strong programming skills in Python and experience with deep learning frameworks like Pytorch, Transformers. Solid understanding of machine learning principles, including supervised learning, unsupervised learning, and reinforcement learning. Proven experience in designing and conducting experiments, analyzing data, and drawing meaningful conclusions. Familiarity with large language models, transformer architectures, and related concepts. Experience with data processing tools and techniques (e.g., Pandas, NumPy). Experience with distributed training frameworks (e.g., DeepSpeed, Megatron-LM). Excellent communication, collaboration, and problem-solving skills. #LATC
These cookies are necessary for the website to function and cannot be turned off in our systems. You can set your browser to block these cookies, but then some parts of the website might not work.
Security
User experience
Target group oriented cookies
These cookies are set through our website by our advertising partners. They may be used by these companies to profile your interests and show you relevant advertising elsewhere.
Google Analytics
Google Ads
We use cookies
🍪
Our website uses cookies and similar technologies to personalize content, optimize the user experience and to indvidualize and evaluate advertising. By clicking Okay or activating an option in the cookie settings, you agree to this.
The best remote jobs via email
Join 5'000+ people getting weekly alerts with remote jobs!