Software Engineer, Protected Data Site Reliability Engineering na Google
Google · San Francisco, Estados Unidos Da América · Onsite
- Professional
- Escritório em San Francisco
Minimum qualifications:
- Bachelor’s degree in Computer Science, a related field, or equivalent practical experience.
- 2 years of experience with software development in one or more programming languages.
Preferred qualifications:
- Master's degree in Computer Science or Engineering.
- 2 years of experience designing, analyzing, and troubleshooting distributed systems.
About the job
Site Reliability Engineering (SRE) combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. SRE ensures that Google's services—both our internally critical and our externally-visible systems—have reliability, uptime appropriate to users' needs and a fast rate of improvement. Additionally SRE’s will keep an ever-watchful eye on our systems capacity and performance.To learn more: check out our books on Site Reliability Engineering or read a career profile about why a Software Engineer chose to join SRE.
Responsibilities
- Lead key projects from the team roadmap covering services underpinning policy compliance, AI data protection, or critical user journey automation.
- Support services and features before they go live through design consulting, developing software platforms and frameworks, capacity planning, and launch reviews.
- Maintain services once they are live by measuring and monitoring performance.
- Scale systems sustainably through mechanisms like automation; evolve systems by pushing for and implementing changes that improve reliability and velocity.
- Participate regularly in on-call rotation, including incident coordination, distributed system debugging, implementing technical mitigations and long term fixes, as well as blameless postmortem authoring.