As part of a team of infrastructure team to ensure smooth operations that conformed with the agreed Service Level Agreement (SLA) and within the budgeted capital and operational expenses.
Roles & Responsibilities:
- To establish ITSM standards, processes and guidelines that are in line with industry standards and best practices
- Provide technical leadership to team members
- Provide technical consultancy to customers on matters related to system performance, integration and configuration.
- Assist project Service Delivery Manager on managing the day-to-day Operations management.
- Subject Matter Expert (SME) level of hands-on knowledge in shell, Perl, Python or PowerShell
- Analyze system performance and makes recommendations for optimization.
- Ensure that the production environment is highly available per the needs of the business and established Service Level Agreements.
- Lead and establish Root Cause Analysis for all High Severity issues. Work with multiple teams for successful resolution of issues and incidents
- Review Incident Report.
- Participate in the upgrade/patching planning and execution of the Middleware software.
- Provide subject matter expertise (SME) for Issues, security related threats and vulnerabilities as it pertains to middleware.
- Participation in establish and test disaster recovery policies and procedures across all Middleware environments
- Ensure adherence to applicable Change and Release Management processes.
- Develop technical documentation and procedure on monitoring, performance.
- Regular projects reporting update to SDM & any other duties as and when assigned
- Lead problem determination on system errors or malfunctions, and works with the application team/supplier to identify, diagnose and rectify the problem.
- Performs capacity study to analyze current resource utilization and estimates future requirements.
- Monitoring system to ensure high availability and optimal performance
- Update documentation of work.
- Supports internal and external audit exercises for the maintenance of various certifications and contractual requirements.
- Keeps abreast of technological advancement, emerging standards and new software or hardware solutions that may affect decisions on systems building or enhancements
- Must be able to work in 24 x 7 operations standby support environment
- Job Requirements
- At least 8 years of working experience in end to end operations management experience which
- includes configuring, upgradation and resolution related to Wintel infrastructures such as Windows
- Servers, OS clustering, Hyper-V and VMware Virtualization, IIS, Biztalk and other related products
- and cloud services such as AWS, MS Azure.
- Specializing in one of major products: Microsoft, VMware, AWS & MS Azure.
- Hands-on experience on Windows system & OS clustering administration, configuration and troubleshooting
- Hands-on experience on Microsoft middleware e.g. IIS administration and configuration
- Hands-on experience on managing large scale environment running on server virtualization
- technologies like VMWare Server/ESX or Microsoft HyperV/Virtual Server
- Good understanding of ITIL processes. Familiar with ISO 9001 and ISO 27001.
- Working experience in cloud environment (AWS, Azure) would be an advantage
- Microsoft Certification (MCSA/MCSE) or equivalent certification in relevant programs is required.
- VMware Certification (VCP) or equivalent certification in relevant programs is required.
- AWS/Microsoft Azure cloud certification is required.
- ITIL v3 certification is required
- RedHat Red hat Enterprise Linux certification is desired
- Cloud related experience in Microsoft Azure, AWS, Google cloud, DevOps tools Ansible, Docker, Jenkins, Kubernetes (including scripting experiences)
- EA Licence No.:18S9405 / EA Reg. No.:R1330864
Skills & Competencies
- Wintel, Windows Servers, os clustering, hyper-v, vmware virtualization