Site Reliability Engineer na Euronet Worldwide, Inc.
Euronet Worldwide, Inc. · Leawood, Estados Unidos De América · Hybrid
- Escritório em Leawood
Description
Since 1996, epay, a business segment of Euronet, has been at the center of connecting local and global brands to consumers. Our capabilities, platforms, products, and solutions cater to the changing consumer demand for content and payments in categories such as mobile, gaming, and entertainment.
We’re looking for a Site Reliability Engineer (SRE) who will own reliability, scalability, and day‑2 operations of our Kubernetes platforms—specifically SUSE Harvester (HCI), Longhorn, and clusters managed with Rancher. You’ll enable product teams to ship safely using GitOps (FluxCD) and Infrastructure as Code (Crossplane), and you’ll partner closely with networking and platform engineering to keep services fast, resilient, and observable.
What you’ll do
- Operate and harden SUSE Harvester environments: lifecycle management, upgrades, node/cluster health, HA, capacity planning, and incident response.
- Administer Longhorn storage for Kubernetes: performance tuning, disaster‑recovery design, backup/restore validation, and troubleshooting volume issues.
- Manage Kubernetes clusters (multi‑cluster, multi‑tenant) including cluster creation, upgrades, admission control, API server health, and etcd care.
- Own CNI operations with Antrea: policy design, network performance, and east‑west traffic observability.
- Run KubeVirt for VM workloads on Kubernetes: plan migrations, right‑size resources, and build reliable pipelines for VM lifecycle.
- Use Rancher to standardize cluster fleet management: provisioning (CAPI), templates, RBAC, and centralized policy/upgrade orchestration.
- Implement GitOps with FluxCD: define release pipelines, drift detection, progressive delivery, and automated rollbacks.
- Provision cloud/on‑prem resources with Crossplane: compose abstractions, manage providers, and enforce guardrails for day‑2 operations.
- Build and maintain SLOs/SLIs: availability, latency, error budgets; automate alerts and runbooks tied to service health.
- Reduce toil through automation: scripting, operators, controllers, and self‑service tooling for developers.
- Participate in on‑call rotations, post‑incident reviews, and reliability roadmaps; drive corrective actions and platform improvements.
Requirements
- 3+ years in SRE/Platform/Systems Engineering (or equivalent) supporting production Kubernetes.
- Hands‑on experience with SUSE Harvester and Longhorn or comparable HCI + distributed block storage.
- Practical knowledge of Antrea CNI, KubeVirt, and Rancher fleet management.
- Proficiency with FluxCD (GitOps patterns, Kustomize/Helm) and Crossplane (Compositions, Providers, RBAC).
- Strong Linux administration (networking, filesystems, performance), observability (logs/metrics/traces), and scripting (Bash/Python).
- Networking fundamentals (TCP/IP, L4/L7), Kubernetes networking/policies, TLS/cert management.
- Experience designing for HA, capacity planning, backup/restore, and disaster recovery.
Nice to have
- Experience with CAPI/Cluster API, RKE2/k3s, CSI drivers, and hardware lifecycle (firmware, BMC).
- Familiarity with service meshes (e.g., Istio/Linkerd), policy engines (OPA/Gatekeeper), and secrets management.
- Infrastructure automation (Terraform/Ansible) and CI/CD (GitHub Actions, GitLab CI, Azure DevOps).
- Prior ownership of SLO programs and error‑budget policies.
How you’ll succeed (first 90 days)
- Audit current Harvester/Longhorn/Rancher landscape; publish reliability baseline and SLOs.
- Stand up or upgrade GitOps pipelines with FluxCD; reduce manual changes to near zero.
- Introduce Crossplane compositions for standard infra; enable dev and devops teams with safe self‑service.
- Document and operationalize runbooks for Antrea/KubeVirt; close top reliability gaps.
Benefits
Euronet employees enjoy outstanding benefits, including:
- 401(k) Plan
- Health/Dental/Vision Insurance
- Employee Stock Purchase Plan
- Company-paid Life Insurance
- Company-paid disability insurance
- Tuition Reimbursement
- Paid Time Off
- Paid Volunteer Days
- Paid Holidays
- Casual Office Attire
- Plus many more employee perks & incentives!
We are an Equal Opportunity Employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, gender, sexual orientation, gender identity, or national origin, age, disability status, genetic information, protected veteran status, or any other characteristic protected by law.