The Cloud DevOps Engineers in this TalentCloud should be able to work with data & product engineering teams and focus on Cloud infrastructure and automation. A key part of the TalentCloud members’ responsibility should be championing and leading infrastructure as code. Additional responsibilities and expectations are:
- Design, deploy and support new and existing Cloud infrastructure in support of current customers and future growth
- Provide a level of escalation in response to incidents/requests regarding performance and configuration, including off-hours escalations
- Maintain system architecture documentation, run-books, diagrams, and assets in the configuration management database
- Develop and continually improve guidelines and procedures related to system management including process and efficiency improvements
- Monitor and tune system infrastructure to identify and eliminate issues and potential bottlenecks
- Prepare and review periodic reports related to performance and capacity
- Manages private/public cloud deployments (VMware, F5, Azure, AWS)
- At least 5 years of experience in working on cloud infrastructure with automation
- At least 3 years of experience with system and application configuration automation (inclusive of scripting with Python/Bash, and DevOps tools like Chef/Puppet/Ansible/Terraform/Jenkins)
- At least 3 years of experience with Linux
- Experiencing with designing, building, and operating distributed complex software systems in on-prem and public cloud environments
- Strong DevOps fundamentals with experience in building CICD pipelines
- Experience with containerization related technologies like Kubernetes, Docker, Vagrant or LXC
- Experience with Public Cloud (AWS, Azure, GCP) and on-prem virtualization technologies(vSphere/KVM)
- Demonstrated proficiency with at least one modern programming language like Java, JavaScript, Python, or Go
- Experience with monitoring and alerting tools like New Relic, Nagios, Icinga, or DataDog
- Excellent written and verbal communication skills
- Good organizational and time management skills to work on multiple tasks/projects simultaneously
- Experience in Infrastructure as Code (IaC)
- Ensure configuration and compliance with configuration management tools
- Experience in mentoring junior software engineers
- Experience with working in a DevOps culture
- Experience in supporting mission-critical systems
- Amazon AWS, Microsoft Azure, and Google Cloud Platform related certifications
Required Skills
- At least 3-5 years of experience building and maintaining Cloud infrastructure (like the following for AWS – VPC, EC2, Security Groups, IAM, ECS, CodeDeploy, CloudFront, S3)
- Strong understanding of how to secure Cloud environments and meet compliance requirements
- Expertise using Chef for configuration management
- Hands-on experience deploying and managing infrastructure with Terraform
- A solid foundation of networking and Linux administration
- Experience with Docker, GitHub, Jenkins, ELK and deploying applications on Cloud
- Ability to learn/use a wide variety of open source technologies and tools
Preferred Skills
- Programming skills with Java, Python
- Scripting Skills with Bash, AWK, Perl, sed, Unix Utilities
- Software Development best practices and skills
- Multiple Cloud platform experience is highly desirable – Azure, AWS, GCP