Search by job, company or skills

Infinite Computer Solutions Pte Ltd

Site Reliability Engineer (SRE)

5-10 Years
SGD 4,250 - 10,000 per month
Save
  • Posted 4 hours ago
  • Be among the first 10 applicants
Early Applicant
Quick Apply

Job Description

Site Reliability Engineer (SRE)

Location: Singapore

Employment Type: Contract 12 months

Job Summary

We are seeking a Site Reliability Engineer (SRE) to ensure the reliability, scalability, performance, and availability of critical applications and infrastructure. The ideal candidate will have strong experience in system administration, cloud platforms, automation, monitoring, incident management, and DevOps practices.

Key Responsibilities

  • Maintain and improve system reliability, availability, and performance.
  • Monitor production environments and proactively resolve issues.
  • Manage incident response, troubleshooting, RCA, and problem management.
  • Automate operational tasks and deployment processes.
  • Implement and maintain monitoring, alerting, and observability solutions.
  • Collaborate with Development, Infrastructure, Security, and Operations teams.
  • Support CI/CD pipelines and release management activities.
  • Create and maintain operational runbooks and documentation.
  • Drive continuous improvement initiatives to reduce manual effort and increase system resilience.

Required Skills

  • Strong Linux/Unix administration experience.
  • Experience with cloud platforms (AWS, Azure, or GCP).
  • Proficiency in scripting (Python, Shell, Bash, or PowerShell).
  • Experience with Docker and Kubernetes.
  • Knowledge of CI/CD tools such as Jenkins, GitLab CI, or Azure DevOps.
  • Experience with monitoring tools such as Prometheus, Grafana, ELK, Splunk, Dynatrace, or AppDynamics.
  • Understanding of networking, load balancing, and system architecture.
  • Strong troubleshooting and analytical skills.

Preferred Qualifications

  • Experience in Banking, Financial Services, or large enterprise environments.
  • Knowledge of Infrastructure as Code (Terraform, Ansible, CloudFormation).
  • Understanding of DevOps, SRE, and ITIL practices.
  • Relevant certifications in AWS, Azure, Kubernetes, or DevOps are an advantage.

Experience

  • 3–10+ years of experience in SRE, DevOps, Cloud Operations, Production Support, or Infrastructure Engineering roles.

 

 

Interested candidates can connect on +6586533349 (WhatsApp chat only)

More Info

Job Type:
Function:

Job ID: 149046755