Apply Now

About the job

Join our Cloud Platform team as a Site Reliability Engineer at Ajackus and become a part of a leading IT firm where innovation meets expertise. If you are passionate about technology, automation, and continuous improvement, this is the role for you. You'll have the opportunity to design, build, and maintain systems crucial to our operations, while ensuring they are both scalable and reliable.


Responsibility

  • Design, build, and maintain scalable, reliable, and highly available infrastructure and services.
  • Utilize Azure Monitoring and Grafana for proactive issue detection and resolution.
  • Develop automation tools to streamline deployment, configuration, and maintenance tasks.
  • Conduct performance analysis and capacity planning for optimal resource utilization.
  • Collaborate with software engineering teams to implement reliable and scalable solutions.
  • Engage in incident response, root cause analysis, and post-mortem reviews.
  • Stay abreast of industry trends through workshops and conferences, sharing knowledge within the company.


Qualitification

  • Higher education in Computer Science, Engineering, or IT, or equivalent experience.
  • Strong programming skills in languages like Python, Go, Java, and proficiency in shell scripting.
  • Experience with Docker, Kubernetes, and infrastructure as code tools such as Ansible or Terraform.
  • Proven problem-solving skills, with a proactive approach to potential issues.
  • Excellent communication and collaboration skills, able to thrive in a dynamic environment.
  • Fluency in English; German and/or Spanish is a plus.


Please note : This is a perm remote role with overlapping working hours 12pm - 9pm IST


Apply Now

Other Jobs