Firmenlogo

Site Reliability Engineering Professional presso IBM

IBM · Bengaluru, India · Hybrid

Candidarsi ora

Introduction

We, at the IBM Chief Information Office (CIO), are a vibrant and forward‑thinking community of business, strategy, and technology professionals. We deliver industry‑leading consulting, world‑class application and business process services, and embrace agile values and principles in everything we do. The CIO organization plays a pivotal role in the digital reinvention of IBM, modernizing and transforming mission‑critical applications that support IBMers across the globe. Our focus is on providing value‑driven, asset‑powered, end‑to‑end solutions that elevate business outcomes and user experiences.

Our team operates as a cross‑functional organization dedicated to delivering enterprise‑grade IT applications and infrastructure services to IBM Business Units and employees worldwide. Our mission is simple yet powerful: to create a highly productive, secure, and seamless environment for every IBMer.

We achieve this by embodying our core principles: Science - Making informed, data‑driven decisions. Ownership - Empowering autonomous teams that take end‑to‑end accountability. Speed - Iterating rapidly to deliver impactful solutions with agility. Openness - Ensuring transparent communication and fostering a culture of trust and collaboration.

We engineer the systems that power the CIO business and continuously innovate to transform it. As we advance, we remain committed to deepening our Agile practices while rigorously maintaining and enhancing the security and resilience of our enterprise IT infrastructure.

Your role and responsibilities

As a Site Reliability Engineering (SRE) Professional, you will leverage deep operational expertise to ensure the resiliency, reliability, security, and scalability of core IT infrastructure and hosting platforms. You will apply your engineering mindset—supported by strong scripting, automation, and coding capabilities—to continuously improve system performance and operational excellence.

Your primary responsibilities include:

• Develop deep expertise across hosting platforms, tools, and technologies, becoming a trusted subject‑matter expert for critical systems.

• Drive efficiency and automation across hosting environments while upholding the highest standards of availability, performance, and security.

• Exhibit a strong passion for the SRE discipline, embracing engineering principles, continuous learning, and operational excellence.

• Maintain, support, and document all managed infrastructure components in alignment with business requirements, IT security policies, and governance frameworks.

• Communicate effectively with customers, partners, and technical teams using collaboration tools to resolve issues and provide clear guidance.

• Evaluate and research new IT tools and technologies, conducting rapid proof‑of‑concepts (POCs) and delivering both architectural insights and tactical recommendations.

• Contribute to innovation, generating ideas for new features and enhancements informed by industry trends and technology advancements.

• Transform managed services by adopting DevOps and automation‑driven approaches to deliver fully automated, highly reliable service operations.

• Collaborate across the organization, participating in cross‑company solution development task forces to support broad technology initiatives.

• Create reusable assets and thought leadership materials, including whitepapers, technical articles, and best‑practice guidance related to IBM tools, technologies, and offerings.

Required technical and professional expertise

•Red Hat Certification:
Possesses either Red Hat Certified Specialist in Cloud Infrastructure (RHCSA‑OpenStack), or Red Hat Certified System Administrator (RHCSA) credentials, demonstrating strong expertise in enterprise Linux environments.

• Monitoring & Automation Expertise:
Hands‑on experience with modern monitoring platforms such as Instana, along with strong scripting capabilities to automate operational tasks and enhance system reliability.

• Service Deployment & Administration:
Proficient in installing, configuring, and managing core infrastructure services including Bind (DNS), Apache, MySQL, and Nginx, ensuring optimal performance and availability.

• Technical & Analytical Strengths:
Strong problem‑solving skills with the ability to diagnose complex issues and communicate solutions clearly across technical and non‑technical teams.

• Networking & Storage Knowledge:
Solid understanding of networking concepts, protocols, and storage systems, enabling efficient troubleshooting and optimal infrastructure design.

• Cross‑Cultural Collaboration:
Demonstrated ability to work effectively with diverse, multicultural, and multi‑ethnic teams, fostering strong collaboration across engineering and operational groups.

• OEM Coordination:
Experience interacting with OEM vendors such as IBM, Lenovo, Supermicro, Cisco, and Palo Alto, driving timely resolution of sophisticated hardware and platform‑level issues.

• Security & Compliance:
Maintains a strong focus on ensuring that all infrastructure components adhere to the latest security standards, compliance requirements, and industry best practices through diligent patching, monitoring, and policy enforcement.

Preferred technical and professional experience

•Red Hat Certification:

Possesses either Red Hat Certified OpenStack Administrator (RHCSA‑OpenStack) or Red Hat Certified System Administrator (RHCSA) credentials, demonstrating strong expertise in enterprise Linux environments.

• Monitoring & Automation Expertise:

Hands‑on experience with modern monitoring platforms such as Instana, along with strong scripting capabilities to automate operational tasks and enhance system reliability.

• Service Deployment & Administration:

Proficient in installing, configuring, and managing core infrastructure services including Bind (DNS), Apache, MySQL, and Nginx, ensuring optimal performance and availability.

• Technical & Analytical Strengths:

Strong problem‑solving skills with the ability to diagnose complex issues and communicate solutions clearly across technical and non‑technical teams.

• Networking & Storage Knowledge:

Solid understanding of networking concepts, protocols, and storage systems, enabling efficient troubleshooting and optimal infrastructure design.

• Cross‑Cultural Collaboration:

Demonstrated ability to work effectively with diverse, multicultural, and multi‑ethnic teams, fostering strong collaboration across engineering and operational groups.

• OEM Coordination:

Experience interacting with OEM vendors such as IBM, Lenovo, Supermicro, Cisco, and Palo Alto, driving timely resolution of sophisticated hardware and platform‑level issues.

• Security & Compliance:

Maintains a strong focus on ensuring that all infrastructure components adhere to the latest security standards, compliance requirements, and industry best practices through diligent patching, monitoring, and policy enforcement.

IBM is committed to creating a diverse environment and is proud to be an equal-opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, caste, genetics, pregnancy, disability, neurodivergence, age, veteran status, or other characteristics. IBM is also committed to compliance with all fair employment practices regarding citizenship and immigration status.

Candidarsi ora

Altri lavori