Hybrid Staff Machine Learning Engineer - Inference bei XPENG
XPENG · Santa Clara, Vereinigte Staaten Von Amerika · Hybrid
- Professional
- Optionales Büro in Santa Clara
- Optimization of models towards deployment on customized AI accelerators
- Write kernels for customized AI accelerators
- Develop performance estimates for critical kernels
- Master in CS/CE/EE, or equivalent, in industry experience
- Strong code skill in C/C++ and Python
- Experience with CUDA programming or related AI accelerator programming.
- Experience with enabling accuracy machine learning modeling inference using low precision data formats
- Familiarity with the fundamentals of deep learning
- Have strong engineering skills to unblock yourself and are willing to pick up whatever knowledge you are missing to get the job done
- Have an understanding of ML architecture and an intuition for how to reduce model latency
- Familiarity with GPU architecture or custom silicon chip architecture
- Experience training deep learning models
- A track record of efficiently solving complex problems collaboratively on larger teams of ML engineers, compiler engineers, kernel writers etc.
- A fun, supportive and engaging environment
- Infrastructures and computational resources to support your work.
- Opportunity to work on cutting edge technologies with the top talents in the field.
- Opportunity to make significant impact on the transportation revolution by the means of advancing autonomous driving
- Competitive compensation package