Platzhalter Bild

Kubernetes Engineer bei G-Research

G-Research · London, Vereinigtes Königreich · Onsite

Jetzt bewerben

Do you want to tackle the biggest questions in finance with near infinite compute power at your fingertips?

G-Research is a leading quantitative research and technology firm, with offices in London and Dallas.

We are proud to employ some of the best people in their field and to nurture their talent in a dynamic, flexible and highly stimulating culture where world-beating ideas are cultivated and rewarded.

This is a role based in our new Soho Place office – opened in 2023 - in the heart of Central London and home to our Research Lab.

The role

We are seeking a highly skilled and motivated Kubernetes Engineer to join our world-class engineering team. In this role, you’ll play a pivotal part in shaping the platforms that enable our research teams to run cutting-edge workloads at scale. You will be responsible for designing, building and operating mature Kubernetes-based infrastructure that is secure, resilient and highly automated, ensuring it can support a wide range of demanding compute requirements. 

Our environment spans everything from GPU-intensive machine learning model training to ultra-low latency workloads, requiring meticulous attention to performance, reliability and scalability. You’ll work closely with researchers and engineers  to deliver platforms that not only meet immediate needs but also anticipate future growth and innovation. 

Key responsibilities of the role include:

  • Designing and operating custom Kubernetes operators and controllers to automate infrastructure beyond off-the-shelf solutions

  • Guaranteeing multi-tenant security and isolation, enforcing RBAC and policies with OPA/Gatekeeper 

  • Engineering GitOps-driven CI/CD pipelines (ArgoCD, FluxCD) for safe, auditable changes

  • Driving Infrastructure as Code practices with Terraform and Helm for reliable and repeatable builds

  • Heavily embedding observability using Prometheus, Grafana and OpenTelemetry to make systems measurable and reliable

  • Staying ahead of Kubernetes evolution, testing and adopting new versions and features early

  • Collaborating with teams across the business, refining requirements, challenging assumptions and prioritising developer experience

  • Contributing to an on-call rotation, taking ownership of incidents, solving problems openly and sharing responsibility in a no blame culture

  • Capturing and sharing knowledge, writing clear runbooks and postmortems, and designing documentation

Who are we looking for?

The ideal candidate will have the following skills and experience: 

  • Strong Linux system engineering background 

  • Strong programming ability in Go or Python, ideally with experience building Kubernetes operators or controllers 

  • Strong understanding of Kubernetes internals, including CRDs, RBAC, custom controllers and scheduler extensions 

  • Hands-on experience with Helm and GitOps workflows 

  • Experience implementing multi-tenant security controls in Kubernetes, including namespace isolation, network policies and Open Policy Agent

  • Proven ability to troubleshoot complex performance and reliability issues across infrastructure and workloads 

  • Experience with observability tools such as Prometheus, Grafana and OpenTelemetry to monitor cluster metrics and health 

  • Strong communication skills, with experience collaborating with internal platform users to gather feedback and deliver improvements 

  • Experience writing and maintaining technical documentation, runbooks and post-incident reviews 

The following skills and experience is desirable:

Experience with Cilium or other CNI plugin 

  • Experience with platforms such as OpenStack or VMware 

  • Experience with Amazon Web Services EKS  

  • Experience working with GPU-intensive workloads, such as large language models, ML training pipelines or scientific computing 

  • Experience with KubeVirt 

  • Contributions to open-source projects within the Kubernetes ecosystem 

  • Familiarity with container runtimes such as CRI-O, containerd and the NVIDIA Container Toolkit 

  • Familiarity with NVIDIA tooling for containers 

  • Understanding of SLOs, SLAs and error budgets 

We value engineers who bring curiosity, pragmatism and collaboration to their work who are motivated to grow continuously while helping those around them do the same. 

Why should you apply?

  • Highly competitive compensation plus annual discretionary bonus
  • Lunch provided (via Just Eat for Business) and dedicated barista bar
  • 30 days’ annual leave
  • 9% company pension contributions
  • Informal dress code and excellent work/life balance
  • Comprehensive healthcare and life assurance
  • Cycle-to-work scheme
  • Monthly company events

G-Research is committed to cultivating and preserving an inclusive work environment. We are an ideas-driven business and we place great value on diversity of experience and opinions.

We want to ensure that applicants receive a recruitment experience that enables them to perform at their best. If you have a disability or special need that requires accommodation please let us know in the relevant section

Jetzt bewerben

Weitere Jobs