Job description
About the Role
Our client is a growing payment platform company looking to hire a DevOps & Site Reliability Engineer to support, maintain, and scale their production systems. This role will work closely with engineering teams to ensure system reliability, performance, and security.
Key Responsibilities
. Manage and maintain cloud infrastructure (AWS / Azure / GCP)
. Build, maintain, and improve CI/CD pipelines
. Ensure high availability, performance, and reliability of systems
. Monitor system health, troubleshoot incidents, and perform root cause analysis
. Automate infrastructure provisioning and deployment using IaC tools
. Support application deployments and production releases
. Work closely with developers to improve system scalability and resilience
. Ensure security best practices across infrastructure and deployments
Requirements
. At least 5 years of experience in DevOps, SRE, or Infrastructure Engineering
. Strong experience with cloud platforms (AWS preferred)
. Hands-on experience with CI/CD tools (e.g. Jenkins, GitHub Actions, GitLab)
. Experience with containerization and orchestration (Docker, Kubernetes)
. Knowledge of monitoring and logging tools (Prometheus, Grafana, ELK, etc.)
. Scripting experience (Bash, Python, or similar)
. Experience supporting production systems in a high-availability environment
. FinTech or payments industry experience is a plus


