Talent.com
Site Reliability Engineer
Site Reliability EngineerReview All • Limeira, São Paulo, Brasil
Site Reliability Engineer

Site Reliability Engineer

Review All • Limeira, São Paulo, Brasil
Há 1 dia
Descrição da vaga

About the Company

This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide.

They are a team of passionate engineers working at the intersection of hardware, software, and network infrastructure, building the fastest, most developer-centric single-tenant cloud infrastructure on the market.

If you share this passion, this role offers the opportunity to help shape the future of internet-scale infrastructure.

This position is being managed in partnership with an external recruitment consultancy supporting the company throughout the hiring process.Summary

The Reliability team is responsible for the health and resilience of the infrastructure powering a global bare metal cloud platform.

As aSenior Site Reliability Engineer (SRE) , you'll focus on building reliable, observable, and self-healing systems at scale.

SREs here operate at the intersection of software engineering and infrastructure — designing tools that automate operations, improve incident response, and enhance observability, ensuring the platform delivers high performance and reliability to customers worldwide.

This role is ideal for engineers passionate about reliability, automation, distributed systems, and bringing cloud-like experiences to bare metal environments.Key Responsibilities

Continuously improve platform reliability and performance.

Design, build, and maintain tools to automate operational workflows and incident response.

Implement and enhance observability systems (monitoring, alerting, tracing).

Collaborate with engineering and platform teams to design scalable and resilient systems.

Participate in on-call rotations and lead post-incident reviews with a learning-focused approach.

Develop and document operational playbooks and processes.

Contribute to defining SLOs / SLIs and driving reliability metrics across teams.Skills & QualificationsRequired :

Fluent verbal and written English communication skills

Advanced experience with Linux / Unix in production environments

Hands-on experience with Kubernetes and container orchestration

Proficiency with IaC tools (e.g., Terraform, Ansible)

Experience with observability stacks (Prometheus, Grafana, Loki, ELK, etc.)

Proficiency with scripting / programming languages such as Bash, Python, Go, or Ruby

Working knowledge of Git and CI / CD pipelines

Experience with incident response and root cause analysis

Knowledge of cloud-native reliability and security best practicesWhat's Offered

Contractor engagement (PJ)

Paid Time Off

Competitive compensation package

Wellness benefit (Wellhub / Gympass equivalent)

Annual performance-based bonus

Flexible working hours

Opportunities for technical and career growth

Criar um alerta de emprego para esta pesquisa

Site Reliability Engineer • Limeira, São Paulo, Brasil

Vagas relacionadas
Senior React Engineer (LATAM | English C1 / C2)

Senior React Engineer (LATAM | English C1 / C2)

Yisrael Technology LLC • Hortolândia, São Paulo, Brazil
We’re looking for a Senior React Engineer to join one of our U.You will work on complex, high-impact applications—often within industries such as finance, insurance, or other data-heavy domains—co...Mostre mais
Última atualização: 7 dias atrás • Promovida
Site Reliability Engineer Sr

Site Reliability Engineer Sr

Mercado Eletrônico • Sumaré, Brasil
O Mercado Eletrônico é líder na América Latina em soluções de gestão de compras B2B.Suas tecnologias e serviços para as áreas de compras ajudam empresas a conquistarem mais economia, agilidade, gov...Mostre mais
Última atualização: 18 dias atrás • Promovida
Site Reliability Engineer (Middle / Senior) ID38916

Site Reliability Engineer (Middle / Senior) ID38916

AgileEngine • Campinas, SP, br
Quick Apply
Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre mais
Última atualização: 27 dias atrás
Site Reliability Engineer (Relocation To Portugal)

Site Reliability Engineer (Relocation To Portugal)

Affinity • Paulínia, São Paulo, Brasil
Please note that we're aiming at an expatriation to Portugal • •We are aPortuguese technology consulting companywith a strong outward look to the rest of Europe. We have 12 years of experience in the ...Mostre mais
Última atualização: 26 dias atrás • Promovida
Full Stack Engineer (Remote, $110K USD)

Full Stack Engineer (Remote, $110K USD)

Ascen • Piracicaba, São Paulo, Brazil
Senior Full-Stack Software Engineer 100% Remote | Must have at least 4 hours of overlap with US Eastern Time (EST) About Ascen Ascen (ascen. We empower staffing firms to focus on their clients and ...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Site Reliability Engineer PL

Site Reliability Engineer PL

Turbi • Itatiba, São Paulo, Brazil
E aí, tudo azul por aí? A Turbi é a locadora do futuro : 100% digital, movida a tecnologia, gente boa e paixão por transformar a forma como as pessoas se locomovem. A gente abre o carro pelo app (si...Mostre mais
Última atualização: 2 dias atrás • Promovida
Site Reliability Engineer (Sre)

Site Reliability Engineer (Sre)

Metacto • Itatiba, São Paulo, Brasil
At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services.As aSite Reliabi...Mostre mais
Última atualização: 2 dias atrás • Promovida
Full Stack Engineer

Full Stack Engineer

AI Stealth • piracicaba, Brasil
Our AI startup has been featured on VICE, New York Times, Futurism, Fox News, and more for disrupting our industry.We still operate in stealth so we aren't able to discuss exactly what we do until ...Mostre mais
Última atualização: 22 horas atrás • Promovida • Nova!
Staff ML Engineer

Staff ML Engineer

TurnKey Tech Staffing • Araras, São Paulo, Brazil
About the Product Niche is the leader in school search.Our mission is to make researching and enrolling in schools easy, transparent, and free. With in-depth profiles on every school and college in ...Mostre mais
Última atualização: 16 dias atrás • Promovida
Site Reliability Engineer (SRE)

Site Reliability Engineer (SRE)

MetaCTO • Paulínia, São Paulo, Brazil
About Us At MetaCTO, we specialize in helping startups and growing companies turn visionary ideas into successful digital products through expert app development and fractional CTO services.As a S...Mostre mais
Última atualização: 4 dias atrás • Promovida
Site Reliability Engineer Pl

Site Reliability Engineer Pl

Turbi • Itupeva, São Paulo, Brasil
A Turbi é a locadora do futuro : 100% digital, movida a tecnologia, gente boa e paixão por transformar a forma como as pessoas se locomovem. A gente abre o carro pelo app (sim, sem chave!) e acredita...Mostre mais
Última atualização: 1 dia atrás • Promovida
System Engineer

System Engineer

InComm Payments • rio claro, Brasil
In this role, you will be critical to the daily operations, maintenance, and optimization of our observability platforms—. Splunk, DynaTrace, and NewRelic.The ideal candidate will be a proactive pro...Mostre mais
Última atualização: 4 dias atrás • Promovida
Full Stack Engineer

Full Stack Engineer

Astra AI • Jaguariúna, São Paulo, Brazil
Location : San Francisco, CA - Remote (LATAM preferred) Work Type : Full-Time We’re partnering with a confidential, high-growth technology company in Silicon Valley that’s building AI-powered platf...Mostre mais
Última atualização: há mais de 30 dias • Promovida
Software Engineer Site Reliability Engineer

Software Engineer Site Reliability Engineer

Scubyt • Itatiba, São Paulo, Brazil
Software Engineer Site Reliability Engineer Location : Brazil REMOTE Duration : Fulltime CLT / REMOTE About the role The Application SRE Team supports several critical components of our foundational...Mostre mais
Última atualização: 4 dias atrás • Promovida
Deployment Reliability Engineer

Deployment Reliability Engineer

HCLTech • Indaiatuba, São Paulo, Brazil
Your role and responsabilities : Manage continuous delivery and configuration of SAP Ariba Cloud products using modern deployment tools. Respond quickly to deployment requests and provide technical ...Mostre mais
Última atualização: 16 dias atrás • Promovida
Site Reliability Engineer

Site Reliability Engineer

HCLTech • Americana, São Paulo, Brazil
Your role and responsabilities : Handling major incidents via CIRS (Critical Issue Response System) and providing frequent updates until resolution. Performing deep-dive application troubleshootin...Mostre mais
Última atualização: 10 dias atrás • Promovida
Site Reliability Engineer (Relocation to Portugal)

Site Reliability Engineer (Relocation to Portugal)

Affinity • Itatiba, São Paulo, Brazil
A Job? Or a Lifetime Experience? Start Yours Here! • •Please note that we're aiming at an expatriation to Portugal • • We are a Portuguese technology consulting company with a strong outward look...Mostre mais
Última atualização: 28 dias atrás • Promovida
Site Reliability Engineer

Site Reliability Engineer

Review All • Itupeva, São Paulo, Brasil
This company operates a global computing platform that enables businesses to programmatically deploy single-tenant Bare Metal instances across multiple regions worldwide. They are a team of passiona...Mostre mais
Última atualização: 1 dia atrás • Promovida