The Company A well-established tech organization building advanced AI products for healthcare and clinical research. The team focuses on secure, reliable platforms that process sensitive medical data and support research and clinical workflows.
Role & Responsibilities As a
Senior SRE , you will : Design and automate infrastructure
(infrastructure-as-code tools) Build and maintain
CI / CD pipelines
and release automation Operate and scale production systems on
major cloud platforms Implement
monitoring, alerting, and incident response
practices Enforce
security and compliance controls
for protected health data Create and test
disaster recovery and continuity plans Produce clear
operational documentation
and runbooks Coach and guide
more junior engineers and on-call teams Work closely with engineering and research teams to enable fast, safe delivery of product features
Job Requirements Must-Haves 5+ years
in an SRE / Infrastructure / Platform role Hands-on with
IaC
(Terraform or equivalent) Production experience with
container orchestration
(Kubernetes) Solid scripting / programming skills (e.g.,
Python ) Proven work with
CI / CD
systems and pipelines Experience running workloads on
cloud providers
(GCP, Azure, or AWS) Familiar with
observability tools
(metrics, logs, tracing) Practical knowledge of
security best practices
and data-protection tooling Strong communication, troubleshooting, and incident response skills
Nice-to-Haves Experience with
healthcare compliance
(HIPAA, SOC 2) Background in
high-performance
or data-intensive environments Prior mentorship or technical leadership experience Deep experience scaling Terraform + Kubernetes at production scale
Site Reliability Engineer • Lavras, Brasil