
Site Reliability Engineer (SRE) bei Reward Gateway
Reward Gateway · London, Vereinigtes Königreich · Hybrid
- Professional
- Optionales Büro in London
Reward Gateway|Edenred is a leading digital platform for services and payments for people at work, connecting 52 million users and 2 million partner merchants in 45 countries via close to 1 million corporate clients.
Due to expansion, an opportunity has become available for a Site Reliability Engineer to join our team to help us transform our existing operational workloads to an SRE approach.
Some of Your Responsibilities & Core Duties will be:
- Day-to-day operations of our complex AWS architecture
- Integrating tightly with our DevOps team members
- Following SRE practices and maintaining high standards of compliance
- Implementing a new standard of observability utilising SLI/SLO/Error Budgets
- Continually evolving our observability platforms for greater coverage
- Using a code-first approach to build and changes to reduce TOIL
- Advocating a strong focus on availability, reliability and uptime
- Liaising with the Engineering teams for the constant evolution of metrics
- Working towards planned roadmap goals
- Actively taking part in the daily stand-ups and keeping sprints on track
- Keeping up-to-date documentation in the JIRA & Confluence tools
- Taking part in SRE Incident Management processes
- Acting as a key Incident Commander within the Incident Management process
- Ensuring a focus on cost efficiency for the platforms & services
- Working with team members to foster collaboration and ongoing communication with stakeholders
The Experience and Key Skills you will have:
- At least 4 years of experience in DevOps or SRE, with a keen interest in growing as a Site Reliability Engineer
- Experience with AWS or other cloud providers
- Enterprise infrastructure experience in HA environments
- Automation skills through Terraform, Python, Bash or similar
- Wide-reaching SRE skills and a deep understanding of SRE practices
- A strong understanding of SQL, PHP, Kubernetes, CI/CD
- Observability product experience (eg, New Relic, Datadog)
- Managing infrastructures using SLI/SLO & Error Budgets
- Ability to work both independently and as part of a team
- Ability to work under pressure and be highly reliable
- Adaptability and flexibility to change in a fast-moving environment
- An ability to learn new tools and processes quickly and impart that knowledge
- Salary on offer ranges from £7,000 to £7,500 gross per month, depending on experience.
- Currently, no bonuses or share options are offered
The Interview Process:
- Screening video interview with the Senior Talent Partner and Head of SRE
- Final interview with the Director of Infrastructure & Head of SRE
Be comfortable. Be you.