Senior Python Core Engineer (Client SDK) bei ClearML
ClearML · Deutschland · Remote
Description
About the company
At ClearML, our mission is to make infrastructure management effortless across every phase of
the AI lifecycle -- from building and training AI models to large-scale production. Trusted by
more than 2,000 organizations, AI builders and IT teams use our AI infrastructure platform to
power everything from early-stage R&D to mission-critical public sector and enterprise-grade AI
pipelines.
We’re growing quickly and looking for curious, self-driven individuals who are excited to shape
the future of AI and the infrastructure that powers it. Our customers are tackling some of the
world’s most important challenges -- revolutionizing healthcare, discovering new medicines,
securing global finance, protecting national security, and preserving our planet’s ecosystems.
About the Role
At ClearML, our SDK is the bridge between data scientists and our infrastructure. We are
seeking a Senior Python Engineer to work on the client-side clearml open-source library. This
is not a standard backend role; you won't be tuning databases. Instead, you will be building the
"magic" that allows ClearML to automatically track experiments, creating wheels for PyPI, and
ensuring our code runs smoothly in diverse user environments.
You will dive deep into Python internals and build integrations with the latest AI frameworks
(PyTorch, TensorFlow, vLLM, etc.). If you love metaprogramming, understanding how imports
work under the hood, and packaging libraries for thousands of users, this role is for you.
Responsibilities
● Core SDK Development: Design and maintain the client-side Python SDK, ensuring
high performance and backward compatibility across Python versions.
● Automatic Instrumentation: Develop logic to automatically capture code execution,
parameters, and metrics using advanced Python techniques (monkey patching,
decorators, introspection).
● ML Framework Integration: Build and maintain deep integrations with leading AI
libraries (PyTorch, TensorFlow, Keras, Hugging Face, vLLM, scikit-learn).
● Packaging & Distribution: Manage the build lifecycle, creating and optimizing Python
wheels and Eggs, and managing releases to PyPI and Conda.
● Cross-Platform Reliability: Ensure the SDK functions seamlessly across Linux,
macOS, and Windows environments.
● Debug User Environments: Troubleshoot complex client-side issues related to
multiprocessing, serialization (pickle/dill), and memory management.
Requirements
Requirements
● 8+ years of professional software development experience.
● Deep Python Expertise: You understand metaclasses, descriptors, and how Python
handles imports and memory.
● Packaging Experience: Proven experience building and packaging Python libraries
(setup.py, poetry, building wheels) and publishing to public repositories (PyPI).
● Runtime Modification: Strong familiarity with monkey patching, AST manipulation, or
writing custom import hooks.
● AI Ecosystem Knowledge: Familiarity with the internals of frameworks like PyTorch,
TensorFlow, or vLLM, llama.cpp.
● Collaboration: Experience working in a remote, distributed team and contributing to
code reviews.
● Excellent communication skills and ability to work independently or as part of a team.
Jetzt bewerben