- Senior
- Escritório em Bangalore
Description
- Deploy, manage, optimize, and troubleshoot large-scale Kubernetes clusters in multi-cloud (AWS, Azure, GCP) and hybrid environments (OpenStack, VMware vSphere).
- Implement cluster autoscaling and resource management strategies with tools such as Karpenter.
- Architect, implement, and manage infrastructure in multi-cloud (AWS, GCP, Azure) and hybrid environments.
- Optimize cloud resource usage leveraging AWS Cost Explorer, Savings Plans, and similar tools on other cloud providers.
- Develop and maintain comprehensive monitoring, logging, tracing, and alerting solutions using Prometheus, Grafana, CloudWatch, Datadog, or similar tools.
- Conduct root cause analysis (RCA) and implement proactive improvements to maximize system uptime, reliability, and performance.
- Design, implement, and maintain robust CI/CD pipelines using Jenkins, GitLab CI/CD, GitHub Actions, or Tekton.
- Promote and implement DevSecOps best practices across teams to automate testing, security scanning, and deployments.
- Integrate comprehensive security practices throughout the software lifecycle (DevSecOps), including vulnerability scanning and secure coding practices.
- Manage secrets securely using Vault, AWS Secrets Manager, Azure Key Vault, or similar tools.
- Ensure adherence to compliance standards and regulatory requirements.
- Implement and enforce governance policies and frameworks to optimize infrastructure usage, reduce costs, and enhance operational efficiency.
- Regularly review and optimize cloud expenditure, performance, and scaling strategies.
- Collaborate closely with architects, developers, QA, product teams, and management stakeholders.
- Clearly communicate complex infrastructure concepts and strategies to diverse stakeholders.
- Bachelor's degree in Computer Science, Information Technology, or related technical discipline (Master’s preferred).
- 14+ years of experience as a Site Reliability Engineer, DevOps Engineer, Platform Engineer, or similar role.
- Extensive expertise in Kubernetes, container orchestration, and related ecosystem.
- Hands-on experience with cloud platforms (AWS, Azure, GCP), OpenStack, VMware vSphere, and hybrid environments.
- Proficiency in scripting and automation languages (Python, Bash, Go, or similar).
- Solid experience with infrastructure as code (Terraform, CloudFormation, Pulumi).
- Strong knowledge of CI/CD tools and pipeline design (Jenkins, GitLab CI/CD, GitHub Actions, Tekton).
- Exceptional troubleshooting and problem-solving skills, coupled with a proactive and continuous learning mindset.
- Certifications in Kubernetes (CKA/CKAD/CKS), AWS (Solutions Architect, DevOps Engineer), Azure, or GCP.
- Familiarity with multi-cloud management tools and strategies.
- Background in software development or software infrastructure management.