Collaborate with application teams to plan and execute EOS upgrades effectively
Establish configuration standards and design guidelines for UNIX and system software to optimize availability, performance, resilience, monitoring, backup, and recovery
Provide technical support and guidance to development and operations teams to resolve system issues
Write and maintain scripts using Perl, KSH, and Python to automate tasks and improve system operations
Utilize development, testing, and deployment tools such as GIT/GITHUB, MAVEN, JENKINS, DOCKER, KUBERNETES, ANSIBLE, PUPPET, CHEF, and NAGIOS to support software lifecycle and system monitoring
Troubleshoot system issues and lead recovery efforts with a proactive approach and sense of urgency
Escalate critical problems to problem management and IMS management for timely resolution
Perform root cause analysis and implement corrective actions to prevent recurrence of issues
Ensure ongoing supportability of operating systems and system software through planning and execution of upgrades
Lead and supervise major change requests and their execution to maintain system integrity
Conduct performance analysis and tuning to optimize system efficiency
Automate system operations to enhance reliability and reduce manual intervention
Track, plan, and drive OS and software upgrades to maintain continuous supportability
Manage backup and recovery processes to safeguard system data and ensure business continuity