Senior Site Reliability Engineer (SRE) at StubHub
StubHub · New York City, United States Of America · Hybrid
- Senior
- Office in New York City
StubHub is on a mission to redefine the live event experience on a global scale. Whether someone is looking to attend their first event or their hundredth, we’re here to delight them all the way from the moment they start looking for a ticket until they step through the gate. The same goes for our sellers. From fans selling a single ticket to the promoters of a worldwide stadium tour, we want StubHub to be the safest, most convenient way to offer a ticket to the millions of fans who browse our platform around the world.
What You'll Do:
- Build out and maintain an observability platform to ensure the reliability, availability, and performance of critical systems.
- Collaborate with cross-functional teams to identify and address potential bottlenecks, optimize resource utilization, and proactively prevent system failures.
- Drive the implementation of automation tools and Infrastructure as Code (IaC) practices to streamline deployment processes, configuration management, and infrastructure provisioning.
- Help develop a center of excellence, fostering a culture of empowering teams to continuously and reliably deliver customer value
- Develop processes, tools and automation to reduce toil across engineering teams
- Ensure Systems effectively balance cost, perfomance and reliability at scale
What You've Done:
- Extensive experience (typically 5+ years) in a site reliability engineering or a related role, demonstrating a strong command of incident management, mitigation, & prevention, troubleshooting, and performance tuning.
- Experience with developing robust, mission-critical systems using one or multiple general-purpose programming languages (e.g., C/C++, Java, C# or any other OOP language)
- Experience with cloud computing (AWS, GCP, Azure)
- A strong track record of aggressively identifying and removing toil through process optimization, automation and system design
- Demonstrated ability to write and maintain code for automation, infrastructure orchestration, and reliability tooling.
- Demonstrated understanding of large scale observability platforms and tools
- Understanding of orchestration system such as Kubernetes
- Accelerated Growth Environment: An environment designed for swift skill and knowledge enhancement, where you have the autonomy to lead experiments and tests on a massive scale.
- Top Tier Compensation Package: Competitive base, equity, and upside that tracks with your impact.
- Flexible Time Off: Embrace a healthy work-life balance with unlimited Flex Time Off, providing you the flexibility to manage your schedule and recharge as needed.
- Comprehensive Benefits Package: Prioritize your well-being with a comprehensive benefits package, featuring 401k, and premium Health, Vision, and Dental Insurance options.
The anticipated gross base pay range is below for this role. Actual compensation will vary depending on factors such as a candidate’s qualifications, skills, experience, and competencies. Base annual salary is one component of StubHub’s total compensation and competitive benefits package, which includes equity, 401(k), paid time off, paid parental leave, and comprehensive health benefits.