Site Reliability Engineer at Qube Research & Technologies
Qube Research & Technologies · Paris, France · Onsite
- Senior
- Office in Paris
Your future role within QRT:
- Design, build, and maintain scalable, reliable, and fault-tolerant systems.
- Monitor system performance, set SLIs/SLOs/SLAs, and ensure service availability targets are met.
- Automate operational tasks such as deployments, monitoring, and incident response.
- Troubleshoot incidents, and perform root cause analysis.
- Implement observability best practices (logging, metrics, tracing) to improve system health visibility.
- Work with development teams to design services that are operable and resilient from day one.
- Continuously improve incident management processes and postmortem culture.
- Strong programming skills (Python and Rust are a plus).
- Deep knowledge of Linux systems, networking, and distributed systems, especially in containerized environment.
- Experience with monitoring/alerting (PromQL, Datadog, Cloudwatch, etc..) tools and OpenTelemetry is an asset.
- Familiarity with CI/CD pipelines and Infrastructure as Code.
- Solid understanding of cloud platforms. Previous AWS experience is an asset.
- Strong troubleshooting and problem-solving mindset.
Apply Now