Key Responsibilities
- TWS design and modification: Create new plans, perform impact analysis, forecast workload needs, and validate requests; maintain and evolve TWS configurations across environments (TWS, DWC, WebAdmin, framework, DB2, workstation agents).
- Troubleshooting and incident management: Resolve Severity 1 incidents, perform cross-functional troubleshooting, escalate as needed, restore services, and perform root-cause analysis; participate in crisis management for scheduler tool incidents.
- TWS administration: Install, configure, upgrade, maintain, and decommission TWS components; manage access rights and security; oversee obsolescence planning.
- Service degradation response: Diagnose, log, purge, and back up scheduler data; manage logs and backups; recover from service degradation and schedule-related crises.
- Batch server and file maintenance: Manage flat file cleanup, TSM backups, and file system health; ensure data integrity and retention policies.
- Batch planning and system management: Coordinate batch plans during releases, DRP exercises, and environment migrations; oversee TWS and batch-related activities.
- Projects and migrations: Lead onboarding/offboarding, site migrations, environment creation and migration (TWS, DB2, flat files); coordinate yearly FAN testing and cross-environment TWS database transfers.
- Performance monitoring and optimization: Analyze and optimize Batch S2i and satellite performance, maintain calendars, resources, events, objects, and batch workflows.
- Development and automation: Create shell and Python scripts, Ansible playbooks for automation; develop automated reporting and tooling to detect and resolve TWS and batch issues.
- Documentation and governance: Maintain standard operating procedures, runbooks, and change documentation; ensure auditability and compliance.
Core Competencies
- Deep expertise in Tivoli Dynamic Workload Scheduler (TWS) and related tools (Master Domain Manager, FTA Unlinks, etc.)
- Strong Unix shell scripting and Python development for automation
- Experience with TWS planning, installation, configuration, upgrades, maintenance, and troubleshooting
- Proficiency with batch processing concepts, scheduling, dependencies, and performance tuning
- Mastery of incident management, crisis response, and cross-functional collaboration
- Familiarity with monitoring, performance tuning, and capacity planning
- Experience with ticketing and QA tools (Jira, HP Quality Center)
- Knowledge of DB2, batch file management, and environment migrations
- CI/CD and automation tooling experience (Ansible, scripting) is a plus
- Strong problem-solving, analytical, and communication skills
Qualifications
- Degree in Computer Engineering or equivalent
- Minimum 5 years of professional experience in batch processing and TWS administration
- Hands-on, end-to-end expertise with TWS: Plan creation, analysis, installation, configuration, upgrades, maintenance, troubleshooting
- Hands-on Unix shell scripting and at least intermediate Python scripting
- Experience with Master Domain Manager, FTA Unlinks, and batch-related failure resolution
- Demonstrated experience in troubleshooting failed jobs, file system issues, and capacity-related challenges
- Familiarity with Jira and HP Quality Center
- Strong knowledge of system monitoring, performance evaluation, and tuning
- Excellent written and verbal communication; ability to work with cross-functional teams
Preferred Qualifications
- Experience with bank/financial services domain or other regulated environments
- Experience with disaster recovery planning (DRP) and site migrations
- Experience with TWS in multi-environment setups (development, test, production)
- Knowledge of CI/CD pipelines related to batch deployment
- Exposure to modern data management practices and backup/restore strategies
Language and Communication
- English fluent; other languages a plus