At INDI, we're passionate about empowering individuals and businesses worldwide. Our cutting-edge recruiters connect leading companies with top talent, fostering a dynamic environment where innovation thrives. Join us in shaping the future of work.Overview of the role : We are looking for a Site Reliability Engineer to build and maintain highly reliable, scalable, and secure Open Shift / Kubernetes clusters. We will need you to approach the problem of building and maintaining production systems from a software engineering perspective with a focus on automation, and reliability.Key responsibilities : - Build and automate and maintain Open Shift / Kubernetes clusters.- Create and enhance tools to make operational workflows more automated.- Configure and maintain additional required supporting infrastructure applications.- Monitor, respond to, and resolve Cluster and infrastructure service issues.- Handle infrastructure and services on prem and in AWS.- Diagnose and resolve problems in Open Shift and / or Kubernetes clusters.- Implement metrics to measure service performance and health.Requirements : - 5+ years of experience as a Site Reliability Engineer.- Deep experience with Linux Administration.- Automation experience with Python, Bash, Salt, or equivalent.- Knowledge installing, managing, maintaining, and troubleshooting Open Shift / Kubernetes clusters.- Advanced English level.Benefits :
Site Reliability Engineer • Brasil, Brasil, BR