As a member of our Cloud Operations organization, you will collaborate with our Development, Product Management, and other groups to design, implement and deliver highly scalable and highly available SaaS solution to our enterprise customers.
Job Skills : -
- 5+ years of hands-on OS and infrastructure support experience.
- Operating Systems - Linux (CentOS, Ubuntu, Amazon), Windows
- Experience with cloud administration (AWS, OCI)
- Knowledge of Jenkins, Terraform, GIT, Ansible
- Should have skills with Bash / Perl, Shell, Python, Groovy.
- Knowledge and understanding of AWS, networks, file shares, SFTP / AS2 / MFT, NFS, backups
- Knowledge of containerization technologies and management tools
- Knowledge & Experience on JIRA, Confluence.
- Monitoring tools – Zabbix, LightRun etc.
Key Responsibilities : -
Analyse and resolve production & non-production issues within set SLAs.Identify the severity level of the customer-impacting issue & react accordingly.Investigate system logs to resolve production issues and restore services.Provide RCA and recommendations for production issues to avoid recurrence.Regularly communicate incident / request status to teams impacted.Deploy, monitor & troubleshoot new software versions in production & non-Production environments.Identify trends in reported issues, report them and suggest possible solutions.Design procedures to eliminate manual processes in the team.Required to have good communication skills - both written and verbal.should be able to work under a high-pressure environment with flexible work timingsBe a part of a 24x7 on-call rotation to provide continuous support.Nice to Have :
AWS cloud platform certification is highly desirable.
Experience with Agile methodologies such as scrum or Kanban
Experience working with Oracle and / or other RDBMS and data load / import / export.
Understanding of Hadoop / Spark clusters and jobs monitoring
IBM WebSphere and WebLogic etc.
Elastic Search, EMR, Kibana
Experience in supporting Informatica and Looker