Talent.com
A vaga não está disponível no seu país.
Site Reliability Engineer (Middle) ID38916

Site Reliability Engineer (Middle) ID38916

AgileEngineBrasília, DF, br
Há +30 dias
Tipo de vaga
  • Quick Apply
Descrição da vaga

Job Description

AgileEngine is an Inc. 5000 company that creates award-winning software for Fortune 500 brands and trailblazing startups across 17+ industries. We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Best Place to Work awards.

If you're looking for a place to grow, make an impact, and work with people who care, we'd love to meet you!

WHAT YOU WILL DO

  • Shift : Monday – Thursday 8AM – 7PM PST (11AM – 10PM EST) with rotating on-call;
  • Manage alerts daily, check systems, and escalate issues as needed;
  • Be part of a team that provides 24×7 on-call support for critical SaaS events;
  • Be available in case of emergencies when team members are not available or need help;
  • Document issues and remediation steps;
  • Proactively create appropriate monitors in the EKS / K8S ecosystem;
  • Deploy to EKS / K8s cluster using Terraform and Helm;
  • Learn and maintain existing infrastructure running under Docker Swarm;
  • Improve existing infrastructure health by implementing checks and scripts to correct known issues;
  • Maintain and develop deployment code;
  • Automate manual tasks;
  • Implement / integrate new technologies in our Cloud Infrastructure;
  • Collaborate with other teams and departments to provide the highest level of support and assistance;
  • Apply a real customer focus when planning deployments / updates, having the customer in the forefront of the mind, and considering the impact on them before making changes;
  • Work closely on solutions with Support, Customer Success, Migration, and Professional Services teams to provide the best in class SaaS service to our customers;
  • Perform RCA and take necessary corrective actions to prevent the recurrence of issues;
  • Create and assign alert-related actions to the appropriate team after the investigation;
  • Handle support requests for environment-specific actions;
  • Identify and provide automation requirements to improve RCA.

MUST HAVES

  • 2+ years of professional experience;
  • Experience working with Datadog ;
  • Hands-on experience as an AWS Cloud Engineer;
  • Working knowledge of EKS / Terraform / Helm;
  • Working Experience with Docker and Docker Swarm;
  • Good understanding of AWS IAM roles and policies;
  • Experience logging and monitoring AWS resources using CloudWatch logs;
  • Experience working in a Linux environment;
  • Proficient in Bash and / or Python scripting;
  • A strong understanding of web technologies such as REST APIs;
  • Working Experience with monitoring solutions, such as Grafana and Prometheus;
  • Excellent oral and written communication skills;
  • Customer-facing communication skills to effectively explain issues and RCAs to them;
  • Experience in Product / Application Support for SaaS-based products;
  • Understanding of APIs, Databases, Systems Architecture, and Design;
  • Designing, implementing, and operating in a DevSecOps;
  • Excellent communication skills, both written and verbal;
  • Ability to work independently as well as within a collaborative environment;
  • A technical aptitude with the desire to learn new and evolving technologies;
  • Upper-Intermediate English level.
  • NICE TO HAVES

  • Experience
  • THE BENEFITS OF JOINING US

  • Professional growth : Accelerate your professional journey with mentorship, TechTalks, and personalized growth roadmaps.
  • Competitive compensation : We match your ever-growing skills, talent, and contributions with competitive USD-based compensation and budgets for education, fitness, and team activities.
  • A selection of exciting projects : Join projects with modern solutions development and top-tier clients that include Fortune 500 enterprises and leading product brands.
  • Flextime : Tailor your schedule for an optimal work-life balance, by having the options of working from home and going to the office – whatever makes you the happiest and most productive.
  • Your application doesn't end here! To unlock the next steps, check your email and complete your registration on our Applicant Site . The incomplete registration results in the termination of your process.

    Requirements

    2+ years of professional experience; Experience working with Datadog; Hands-on experience as an AWS Cloud Engineer; Working knowledge of EKS / Terraform / Helm; Working Experience with Docker and Docker Swarm; Good understanding of AWS IAM roles and policies; Experience logging and monitoring AWS resources using CloudWatch logs; Experience working in a Linux environment; Proficient in Bash and / or Python scripting; A strong understanding of web technologies such as REST APIs; Working Experience with monitoring solutions, such as Grafana and Prometheus; Excellent oral and written communication skills; Customer-facing communication skills to effectively explain issues and RCAs to them; Experience in Product / Application Support for SaaS-based products; Understanding of APIs, Databases, Systems Architecture, and Design; Designing, implementing, and operating in a DevSecOps; Excellent communication skills, both written and verbal; Ability to work independently as well as within a collaborative environment; A technical aptitude with the desire to learn new and evolving technologies; Upper-Intermediate English level.

    Criar um alerta de emprego para esta pesquisa

    Site Reliability Engineer • Brasília, DF, br

    Vagas relacionadas
    C# Engineer (Senior / Lead) ID41548

    C# Engineer (Senior / Lead) ID41548

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: 12 dias atrás
    Full Stack Engineer ID40985

    Full Stack Engineer ID40985

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: 19 dias atrás
    [DG] Senior Full Stack Engineer

    [DG] Senior Full Stack Engineer

    LatamCentBrasília, Federal District, Brazil
    Remota
    Quick Apply
    Full-Time | Remote from Latin America | Required Overlap : 9 AM - 3 PM PST (6 hours).We're hiring a Senior Full Stack Engineer to design, build, and scale EkLines AI-powered documentation platform.Y...Mostre maisÚltima atualização: há mais de 30 dias
    Full Stack Engineer ID38918 ($2,500 signing bonus)

    Full Stack Engineer ID38918 ($2,500 signing bonus)

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: há mais de 30 dias
    Backend Engineer ID37418 ($3,000 signing bonus)

    Backend Engineer ID37418 ($3,000 signing bonus)

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: há mais de 30 dias
    Desenvolvedor de Software Pleno

    Desenvolvedor de Software Pleno

    NortelliNúcleo Bandeirante - Brasília, DF, BR
    Desenvolver e manter sistemas web de alta performance e estabilidade;.Criar e evoluir aplicativos desktop com foco em usabilidade e eficiência. .Implementar integrações complexas entre plataformas e...Mostre maisÚltima atualização: 20 dias atrás
    Software Engineer (Middle / Senior) ID42373

    Software Engineer (Middle / Senior) ID42373

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: 1 dia atrás
    Cloud Network Engineer in Security Domain

    Cloud Network Engineer in Security Domain

    CodiLimeBrasília
    Quick Apply
    CodiLime is a software and network engineering industry expert and the first-choice service partner for top global networking hardware providers, software providers and telecoms.We create proofs-of...Mostre maisÚltima atualização: há mais de 30 dias
    Full Stack Engineer (Senior) ID40713

    Full Stack Engineer (Senior) ID40713

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: 19 dias atrás
    Full Stack Engineer ID40916 ($2,500 signing bonus)

    Full Stack Engineer ID40916 ($2,500 signing bonus)

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: 21 dias atrás
    Full Stack Engineer (Senior / Lead) ID41207

    Full Stack Engineer (Senior / Lead) ID41207

    AgileEngineBrasília, DF, br
    Quick Apply
    Fortune 500 brands and trailblazing startups across 17+ industries.We rank among the leaders in areas like application development and AI / ML, and our people-first culture has earned us multiple Bes...Mostre maisÚltima atualização: 18 dias atrás
    Sales Solutions Manager | SaaS Sales & Product Collaboration

    Sales Solutions Manager | SaaS Sales & Product Collaboration

    RoverpassBrasília, Distrito Federal, BR
    Quick Apply
    RoverPass, the ultimate reservation software, makes the reservation process easy to manage by streamlining your day-to-day operations and provides the most comprehensive set of campground managemen...Mostre maisÚltima atualização: 29 dias atrás