Key Responsibilities
Trading/Payment System Operations Support
- System Stability and Performance: Participate in the deployment, monitoring, and daily maintenance of core trading components. Assist with system performance tuning to ensure stable operation.
Cloud-Native and Automation Practices
- Infrastructure as Code (IaC): Familiar with and apply tools like Terraform and Ansible to participate in the automated deployment and management of cloud resources, particularly AWS.
- Containerized Platform: Participate in the daily maintenance and optimization of Kubernetes (K8s) clusters, assisting with service elastic scaling and traffic management.
- CI/CD Processes: Participate in the building and maintenance of CI/CD pipelines, familiar with tools such as GitLab CI and GitHub Actions, supporting rapid code deployment and release.
Monitoring, Alerting, and Security
- Monitoring and Alerting: Participate in the construction and maintenance of monitoring and alerting systems, familiar with tools like Prometheus and Grafana, ensuring timely detection and response to system anomalies.
- Security Practices: Understand DevSecOps principles, participate in the application of security scanning tools, and assist in maintaining production environment security policies.
Qualifications
Experience Requirements
- Years of Experience: 2+ years of experience in IT infrastructure or DevOps/SRE related roles.
- Industry Background: Experience in internet finance, e-commerce, or high-concurrency system operations is preferred.
Hard Skills
- Operating Systems & Scripting: Solid foundation in Linux operating systems, familiar with Shell/Python scripting, and possess basic troubleshooting skills.
- CI/CD Tools: Familiar with at least one CI/CD tool, such as GitLab CI, GitHub Actions, or Jenkins.
- Containers & Orchestration: Familiar with Docker container technology and understand basic Kubernetes (K8s) concepts and operations.
- Infrastructure as Code (IaC): Familiar with IaC tools like Terraform and Ansible, and have experience using the AWS cloud platform.
- Monitoring & Alerting: Familiar with the use of monitoring tools such as Prometheus and Grafana.