Bachelor’s degree in Computer Science, IT or a related field or equivalent work experience.
This position will manage all work performed in a 24/7 high-availability operations environment. The candidate must be available to work most weekday, weekends and holidays on an on call basis.
At least 5 years experience in a technology leadership role managing 24 x 7 computer operations in an enterprise multi-platform environment
Experience leading diverse groups, managing global or remote teams, managing workload, schedules, performance reviews, hiring/interviewing, training, and development plans.
Must have a strong dedication to customer service of both Internal and External clients.
Experience with the following on mainframe and modern server technologies: batch processing, scheduling, storage, web services, networking, and transmissions.
Must have the ability to manage multiple tasks, establish priorities, meet deadlines and concentrate on detailed information in a fast-paced, demanding work environment
Must manage and lead incident escalation, problem solving and remediation from start to resolution for all for production issues by coordinating with the multiple teams to resolve these issues.
Responsible for ensuring that all root-cause analysis is researched and documented for high severity incidents
Ensures Production Environment SLAs are met. Monitors all Environment Availability and Troubleshoots issues as required and escalates as necessary to ensure that all incidents are identified, triaged and resolved within SLA guidelines.
Requests software fixes and enhancements from application developers as necessary
Monitors and tracks automated batch processing and file transfers. Creates and maintains documentation. Troubleshoots and corrects problems with existing automation and procedures.
Analyze and Recover failed batch processes and scheduling issues as required.
Ensure successful file copies and/or transfers between systems
Assist less experienced team members, reviews their work, and assists in problem resolution as necessary. Provide mentorship and team development opportunities
Reviews and Implements continuous improvement regarding: Monitoring Strategies, Tools, Triage Procedures, Documentation and other Processes.
Respond to high severity issues and work to restore normal service operations as quickly as possible
Represent Technology Operations to the organization ensuring that high-availability and the ability to identify customer-facing issues is included in the development or deployment of new products and services.
Work with architecture, development and engineering teams to identify root cause for recurring incidents and create an action plan for resolution.
Monitor systems and services for most efficient operation, identifying fault conditions as well as opportunities for further optimization
Maintain escalation and contact lists for mission critical systems and services
Skills/Qualifications/Other Personal Attributes Required:
Excellent verbal and written communication skills
Able to work under pressure to meet tight deadlines
Flexible – able to meet changing requirements and priorities.
Able to manage conflict effectively
Strong Knowledge of Enterprise Application and Business-to-Business Integration.
Working knowledge of OS/390/zOS, TSO/ISPF and JCL.
Working knowledge of CA-7, CA-Autosys, or other similar scheduling software
Effective analytical, communication, interpersonal and problem-solving skills.
Good understanding of Mainframe and Windows platforms
SQL a Plus
Proficiency using Microsoft Office software products.