Search by job, company or skills

Z

Cloud Infra Analyst - Contract

2-4 Years
SGD 6,000 - 7,000 per month
new job description bg glownew job description bg glownew job description bg svg
  • Posted a month ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Role Purpose

The Tech Arch & Cloud Infra Analyst (Operations & Support) is responsible for the day-to-day management, maintenance, and support of the client's enterprise cloud environment on Amazon WebServices (AWS). This role serves as the operational backbone of the cloud infrastructure function, ensuring the reliability, availability, performance, and security of all AWS-hosted systems and services.

The successful candidate will work at the intersection of infrastructure engineering, operations, and security - partnering closely with development, security, and business teams to maintain a high-availability cloud environment that meets regulatory, performance, and cost-efficiency standards.

Key Responsibilities

1. Cloud Infrastructure Management

. Provision, configure, and maintain AWS resources including EC2 instances, S3 storage, RDS databases, VPCs, Lambda functions, and other services as required.

. Manage and automate infrastructure deployments using Infrastructure as Code (IaC) tools such as AWS CloudFormation or Terraform.

. Implement and manage configuration management systems (e.g., Ansible, Chef, Puppet) to ensure consistency and standardization across environments.

. Monitor cloud resource utilization and continuously optimize for cost-efficiency, identifying rightsizing and savings opportunities.

. Maintain environment hygiene through regular audits, patch management, and lifecycle management of cloud resources.

2. System Monitoring & Performance Management

. Implement and configure comprehensive monitoring solutions (e.g., Amazon CloudWatch, Prometheus, Grafana) to track the health, performance, and availability of AWS resources.

. Analyze monitoring data to identify trends, detect anomalies, and proactively address potential issues before they impact operations.

. Perform performance tuning and optimization of AWS resources to ensure consistently optimal throughput and response times.

. Develop and maintain dashboards and operational reports to provide clear visibility into system performance, capacity, and health metrics.

. Establish alerting thresholds and escalation procedures aligned with service-level agreements (SLAs).

3. Incident Management & Support

. Respond to and resolve incidents, service requests, and automated alerts related to AWS infrastructure in a timely and effective manner.

. Troubleshoot complex multi-layer issues, perform root cause analysis (RCA), and implement corrective and preventive actions.

. Provide after-office hours support for critical system activities, maintenance windows, and emergency responses as required.

. Collaborate with development, security, and business teams to coordinate incident resolution and ensure timely closure within agreed SLAs.

Maintain detailed incident logs and post-incident reports to drive continuous learning and prevention.

4. Security & Compliance

. Implement and enforce cloud security best practices to protect AWS resources, data, and workloads from threats and unauthorized access.

. Configure and manage key security services including IAM policies, Security Groups, Network ACLs (NACLs), AWS Security Hub, GuardDuty, and AWS Config.

. Ensure the cloud environment remains compliant with relevant industry standards and regulatory frameworks including GDPR, HIPAA, and PCI DSS.

. Conduct regular security audits including log reviews, account access reviews, and vulnerability assessments to identify and mitigate risks.

Support penetration testing activities and remediate identified vulnerabilities in accordance with risk prioritization.

5. Automation & Continuous Improvement

. Identify, design, and implement automation solutions to reduce manual effort and improve the reliability and repeatability of operational tasks.

. Develop and maintain scripts and tooling (e.g., Python, Bash, PowerShell) to automate infrastructure management, deployment, and monitoring workflows.

. Implement and support CI/CD pipelines in collaboration with development teams, ensuring seamless and secure delivery of infrastructure changes.

. Stay current with emerging AWS services, cloud-native patterns, and industry best practices, and champion their adoption where appropriate.

Lead and contribute to continuous improvement initiatives that enhance platform reliability, reduce operational toil, and improve team efficiency.

6. Documentation & Knowledge Management

. Create and maintain comprehensive, up-to-date documentation of AWS infrastructure architecture, configurations, and standard operating procedures (SOPs).

. Develop and curate a knowledge base of common issues, troubleshooting guides, resolutions, and best practices for use by the operations team.

. Produce clear and actionable runbooks for routine and emergency operational procedures.

Provide training, guidance, and technical support to other teams on AWS infrastructure practices, tools, and operational standards.

Required Qualifications & Experience

Education

. Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field or equivalent practical experience.

Experience

. Minimum 2 years of hands-on experience in cloud infrastructure operations, preferably in an enterprise AWS environment.

. Demonstrated experience managing and supporting production AWS environments at scale.

. Proven experience with Infrastructure as Code tools (Terraform and/or AWS CloudFormation).

. Experience with Linux and/or Windows server administration in a cloud context.

. Hands-on experience with monitoring and observability platforms (CloudWatch, Prometheus, Grafana, or equivalent).

Technical Skills

. AWS Core Services: EC2, S3, RDS, VPC, IAM, Lambda, ELB, Auto Scaling, Route 53, CloudFront

. Security Services: IAM, Security Groups, NACLs, AWS Security Hub, GuardDuty, AWS Config, CloudTrail

. IaC & Config Management: Terraform, CloudFormation, Ansible (or equivalent)

. Monitoring & Observability: Amazon CloudWatch, Prometheus, Grafana, or similar

. Scripting: Python, Bash, and/or PowerShell for automation and tooling

. CI/CD: Experience with Jenkins, AWS CodePipeline, GitLab CI, or equivalent

Compliance Frameworks: Working knowledge of GDPR, HIPAA, and/or PCI DSS requirements

Preferred Qualifications

. AWS Certified Solutions Architect (Associate or Professional)

. AWS Certified SysOps Administrator - Associate or AWS Certified DevOps Engineer - Professional

. Experience with container orchestration platforms (Amazon EKS, ECS, or Kubernetes)

. Exposure to site reliability engineering (SRE) principles and practices

. Experience with cost optimization tools (AWS Cost Explorer, Trusted Advisor, third-party FinOps tools)

. Familiarity with ITIL v4 framework for IT service management

More Info

Job Type:
Industry:
Employment Type:

Job ID: 143006189