Kubernetes-container-platforms Remote- & Homeoffice Jobs in Frankreich

Remote Site Reliability Engineer H/F Site Reliability Engineer H/F

Fintecture · France · France · Remote

Details zum Jobangebot

We are building a microservice ecosystem of applications serving a multitude of payment and data services for end users.


We are seeking a passionate Expert Site Reliability Engineer (SRE) with a DevOps vision to join our dynamic team. If you have expertise in GCP, Kubernetes, Datadog, Go, and NodeJS, this opportunity is for you! This is a great opportunity to help Fintecture build a new Site Reliability Engineering team. You will be responsible for detecting and mitigating customer-impacting incidents at Fintecture and building solutions to support the reliability and availability of Fintecture services.


The ideal candidate will have experience in technical operations roles (ideally SRE) and programming skills.


Responsibilities


  • Problem Discovery: Anticipate issues from happening or discover existing issues within distributed cloud-native applications using logs, telemetry, and alerting.
  • Urgent Problem Mitigation: Take ownership of the system incident response process, mitigate urgent problems and collaborate with teammates to solve underlying issues.
  • Automation: Write code to automate mitigations and improve tools, making processes more efficient.
  • Training: Teach other how to create greate observability dashboards, how to detect and fix recurring problems, and continually promote the practice.
  • Reliability Practices: Provide and institute proven practices around reliability, remediations, and troubleshooting.
  • Tool Development: Build vital and efficient tooling to lower the barrier of entry for engineering teams to plug in and enjoy the benefits of reliability.
  • Team Management: Manage the run team and organize tickets.
  • Business Vision for Observability: Incorporate observability into a business vision, not just an infrastructure vision.


Why Join Us?

Stimulating Environment: Work with a talented team passionate about new technologies.

Innovative Projects: Participate in high-value projects in an international context.

Career Growth: Opportunities for professional development and career progression.

Company Culture: A culture focused on innovation, collaboration, and continuous improvement.


Required Skills


  • Google Cloud Platform (GCP): Expertise in managing and configuring GCP services.
  • Kubernetes: Proficiency in managing Kubernetes clusters and container orchestration.
  • Datadog: Advanced skills in monitoring and performance management with Datadog.
  • Programming: Strong skills in Go and NodeJS, enabling you to read application code and propose solutions during incidents.
  • DevOps Methodology: Deep understanding of DevOps practices and tools, including CI/CD, Infrastructure as Code (IaC), Terraform, and automation.
  • Problem-Solving: Ability to analyze and effectively resolve complex issues.
  • Team Collaboration: Excellent communication and collaboration skills with multidisciplinary teams. A customer-first mindset.


Preferred Experience


  • Experience in monitoring large-scale SaaS-type products or services.
  • Experience in a software development environment.
  • Experience in a Software, Infrastructure, Systems, and/or Site Reliability Engineering role.
  • A successful track record of troubleshooting distributed systems during service incidents while remaining level-headed.
  • A strong curiosity for the unknown and not stopping until you have a solid understanding.
  • An understanding of what makes up the incident lifecycle.


We are convinced that the diversity of our employees in all its forms is an asset for our company and its customers. That's why our jobs are open to everyone. We guarantee equal opportunities in our recruitment process and throughout your career at Fintecture.


Recruitment process :


1. Call with VP of Engineering (30 min)

2. Call with Lead DevOps (30 min)

3. Technical test live with people of the Squad Platform team (3 hours)

4. Call with CTO (30 min)

5. Call with CEO (30 min)