ROLE SUMMARY
We are looking for Operations Support Engineers to manage and support cloud and on-prem infrastructure.
The role focuses on AWS, servers, monitoring, automation, security, and reliability to ensure systems run smoothly with high availability.
Senior roles will also handle team leadership, incidents, and operational governance.
KEY RESPONSIBILITIES
- Support and manage AWS cloud infrastructure across Dev/Test/Prod
- Monitor system health, logs, alerts, and performance
- Maintain Linux & Windows servers, virtual machines, and containers
- Implement automation and Infrastructure as Code (Terraform, Ansible, CloudFormation)
- Manage monitoring tools (CloudWatch, Prometheus, Grafana, ELK)
- Ensure security, access control, and compliance standards are followed
- Perform patching, upgrades, backup, and disaster recovery activities
- Troubleshoot incidents and support system availability
- Create documentation, runbooks, and SOPs
- Collaborate with application and engineering teams
REQUIRED SKILLS & EXPERIENCE
- Experience with Linux & Windows Server administration
- Hands-on with VMware vSphere / Hyper-V
- AWS services: EC2, ECS, S3, RDS, IAM, VPC, CloudWatch
- Monitoring & observability tools experience
- Automation & IaC tools (Terraform, Ansible, CloudFormation)
- Containers: Docker / Kubernetes / ECS
- Scripting: Python, PowerShell, Bash
- Basic to strong networking knowledge (DNS, TCP/IP, VPN)
- Experience with backup, DR, and high availability
- GitHub and CI/CD exposure