Responsibilities
- Deliver excellent customer experience by conducting regular system reviews and proactively improving system performance and resilience
- Diagnose and resolve incoming system incidents promptly to minimize service disruption
- Analyze technical problems and collaborate with teams to define effective solutions
- Partner with Engineering to develop technical plans, configurations, and solutions based on business and application requirements
- Establish and maintain configuration standards and design guidelines for Linux and system software covering availability, performance, resilience, monitoring, backup, and recovery
- Provide technical support and guidance to development and operations teams to ensure smooth system functioning
- Troubleshoot system issues and lead recovery efforts, escalating major incidents to problem and IMS management
- Perform root cause analysis and implement preventive actions to avoid recurrence of issues
- Lead and manage outsourced technical teams of system administrators to ensure quality delivery
- Ensure operating system and system software supportability by planning and leading upgrade projects
- Review, approve, and supervise execution of major change requests to maintain system integrity
- Conduct performance analysis and tuning to optimize system efficiency
- Automate system operations to improve reliability and reduce manual effort
- Manage patching and upgrade processes for middleware software to maintain continuous supportability
- Oversee backup and recovery operations to safeguard data integrity
- Conduct asset management to track and maintain system resources
- Design and enforce access control plans and matrices for infrastructure services, performing regular access reviews and validations
- Implement monitoring solutions to ensure efficient and reliable service delivery
- Apply new technologies and processes to enhance system supportability, recoverability, availability, and performance
- Provide operational expertise and support across project phases including requirements gathering, design, procurement, implementation, and handover to operations
- Mentor and provide technical guidance to develop less experienced engineers
- Support audit and compliance activities to meet regulatory and organizational standards
- Conduct capacity planning and performance management to ensure system scalability and stability
Preferred competencies and qualifications
- Minimum 5 years of system support experience on Linux/UNIX platforms
- Regional experience is preferred
- Strong technical knowledge of Linux, Solaris, UNIX hardware, operating systems, and system services such as volume manager, file system, NTP, DNS, clustering, SSH, TSM, ITM
- Experience troubleshooting and tuning Linux/UNIX systems
- Experience implementing and operating Linux server clustering
- Knowledge of information security principles
- Understanding of data communication and network concepts
- Experience with middleware and application execution in Linux/UNIX environments is an advantage