Hybrid Site Reliability Engineer (SRE) – Infrastructure & Automation na Armor
Armor · Pune, Índia · Hybrid
- Senior
- Escritório em Pune
LOCATION: IN OFFICE IN PUNE, INDIA
At Armor, we are committed to making a meaningful difference in securing cyberspace. Our vision is to be the trusted protector and de facto standard that cloud-centric customers entrust with their risk. We strive to continuously evolve to be the best partner of choice, breaking norms and tirelessly innovating to stay ahead of evolving cyber threats and reshaping how we deliver customer outcomes. We are passionate about making a positive impact in the world, and we’re looking for a highly skilled and experienced talent to join our dynamic team.
Armor has unique offerings to the market so customers can a) understand their risk b) leverage Armor to co-manage their risk or c) completely outsource their risk to Armor.
Learn more at: https://www.armor.com
Summary
We are looking for a highly skilled Site Reliability Engineer (SRE) to join our infrastructure team with expertise across Cloud Deployments, Microsoft Entra ID (Azure AD), Active Directory, Office 365, Zerto, Rubrik, VMware, and NSX-T. This hands-on, automation-heavy role focuses on system reliability, scalability, and proactive problem prevention.
You’ll be responsible for building and maintaining resilient infrastructure, automating repetitive tasks, monitoring and improving performance, and driving incident reduction strategies across hybrid cloud environments.
Essential Duties and Responsibilities (Additional duties may be assigned as required)
Identity & Access Management:
- Administer and maintain Microsoft Entra ID and on-prem Active Directory environments.
- Configure conditional access, identity protection, and secure authentication policies.
Virtualization & Networking:
- Deploy and administer VMware vSphere and NSX-T environments.
- Design and implement scalable, secure virtual network topologies.
- Assist in migrating and managing workloads across VMWare, AWS, Azure, and OCI when necessary
Productivity Platform Administration:
- Manage and optimize Active Directory and Office 365 services, including Exchange Online, SharePoint, Teams, and Intune.
Automation & Reliability Engineering:
- Write clean, maintainable code (Python, PowerShell, etc.) to automate ops tasks and monitor system health.
- Implement self-healing mechanisms and proactive detection of issues.
- Contribute to infrastructure-as-code efforts using Terraform, with a focus on modularity and DRY principles.
Monitoring & Observability:
- Define SLIs/SLOs and build custom dashboards and alerts using tools like Datadog, Prometheus, Grafana, Splunk, or equivalent.
- Collaborate with engineering and security teams to reduce toil and improve platform stability.
- Design and Implement monitoring, alerting, and incident response workflows for new production deployments
Disaster Recovery & Data Protection:
- Maintain and tune Zerto replication and Rubrik backup infrastructure.
- Ensure DR strategies meet business RTO/RPO and compliance requirements.
Incident Management & RCA:
- Lead response and resolution for high-impact infrastructure incidents.
- Conduct blameless postmortems and drive continuous improvement.
- Participate in on-call rotations and root cause analysis for incidents impacting production services
Vulnerability Patch Management
- Lead the automated vulnerability patch management program through automation
Required Skills & Qualifications
- 8+ years of experience in SRE, DevOps, Systems, or Infrastructure Engineering roles in production environments
- 8+ years of experience in Windows production environments
- 3+ years of experience with *nix production deployments
- 3+ years experience with Kubernetes
- Strong troubleshooting skills, especially in complex or hybrid environments
- Strong communication skills with a robust command of English
Cloud & Virtualization:
- Deep hands-on knowledge of VMware technologies (vSphere, ESXi, vCenter, NSX-T)
- Experience with Oracle Cloud Infrastructure (OCI) — compute, networking, IAM,
- Experience in AWS using Terraform to manage environments,
- Experience in Azure environments, particularly focused on Entra ID
- A clear understanding of Secure Landing Zone concepts
Identity & Backup:
- Expertise in Microsoft Entra ID (Azure AD), and on-prem Active Directory — administration, monitoring, troubleshooting
- Experience with Zerto and Rubrik platforms
- Cloud understanding of RTO/RPO and meeting SLAs for DR
Infrastructure as Code & Automation:
- Experience with Terraform, Ansible, or similar IaC tools
- Strong scripting skills in Python, PowerShell, Bash, or equivalent
- CI/CD and GitOps experience using tools like GitLab CI, Jenkins
- Proficient with version control systems (Git)
Monitoring & Observability:
- Experience with monitoring and alerting tools like Prometheus, Grafana, ELK, and Datadog
Networking & Security:
- Understanding of system-level networking, DNS, firewalls, and load balancing
- Familiarity with security and compliance frameworks (PCI, HIPAA, ISO, etc.)
WHY ARMOR
Join Armor if you want to be part of a company that is redefining cybersecurity. Here, you will have the opportunity to shape the future, disrupt the status quo, and be a part of a team that celebrates energy, passion, and fresh thinking. We are not looking for someone who simply fills a role – we want talent who will help us write the next chapter of our growth story.
Armor Core Values:
- Commitment to Growth: A growth mindset that encourages continuous learning and improvement with adaptability in the face of challenges.
- Integrity Always: Sustain trust through transparency + honesty in all actions and interactions regardless of circumstances.
- Empathy In Action: Active understanding, compassion and support to the needs of others through genuine connection.
- Immediate Impact: Taking initiative with swift, informed actions to deliver positive outcomes.
- Follow-Through: Dedication to delivering finished results with attention to quality and detail to achieve the desired outcomes.
WORK ENVIRONMENT
The work environment characteristics described here are representative of those an employee encounters while performing the essential functions of this job. The noise level in the work environment is usually low to moderate. The work environment can be either in an office setting or remotely from anywhere.
Equal opportunity employer - it is the policy of the company to comply with all employment laws and to afford equal employment opportunity to individuals in all aspects of employment, including in selection for job opportunities, without regard to race, color, religion, sex, national origin, age, disability, genetic information, veteran status, or any other consideration protected by federal, state or local laws.