The Company
A well-established tech organization building advanced AI products for healthcare and clinical research. The team focuses on secure, reliable platforms that process sensitive medical data and support research and clinical workflows.
Role & Responsibilities
As a Senior SRE , you will :
- Design and automate infrastructure (infrastructure-as-code tools)
- Build and maintain CI / CD pipelines and release automation
- Operate and scale production systems on major cloud platforms
- Implement monitoring, alerting, and incident response practices
- Enforce security and compliance controls for protected health data
- Create and test disaster recovery and continuity plans
- Produce clear operational documentation and runbooks
- Coach and guide more junior engineers and on-call teams
- Work closely with engineering and research teams to enable fast, safe delivery of product features
Job Requirements
Must-Haves
5+ years in an SRE / Infrastructure / Platform roleHands-on with IaC (Terraform or equivalent)Production experience with container orchestration (Kubernetes)Solid scripting / programming skills (e.g., Python )Proven work with CI / CD systems and pipelinesExperience running workloads on cloud providers (GCP, Azure, or AWS)Familiar with observability tools (metrics, logs, tracing)Practical knowledge of security best practices and data-protection toolingStrong communication, troubleshooting, and incident response skillsNice-to-Haves
Experience with healthcare compliance (HIPAA, SOC 2)Background in high-performance or data-intensive environmentsPrior mentorship or technical leadership experienceDeep experience scaling Terraform + Kubernetes at production scale