VP, Team Lead, SRE Engineer (Mainframe Applications Support), Core Banking Technology, Group Technology
Salary undisclosed
Checking job availability...
Original
Simplified
- End to End Handing of Critical Core banking applications production incidents with work ranging from incident analysis, driving recoveries along with interfacing systems, communicate on major incidents,
- Code fix, SIT Testing, UAT Test planning (Scope, duration), review production changes before deployment Source promotion to production.
- Application improvements ranging from performance and operational improvements, identification and remediation of system and automate Toils.
- Automation of manual activities/ processes and System Health checks for Production teams. (Automation experience required) and ensuring SLIs/ SLOs are met.
- Working in a stretch role to take forward production issues from analysis to production fix/deployment
- Follow Production Support Processes and giving input to strengthen time to time
- Providing status to leads, stakeholders and working with vendors to review the design/fix/enabling for production deployment
- Own communication for Incidents (SLA breaches, Application Major Incidents, Logistics issue) and responsible for communications with management
- Coordinate recurring issues and ensure long-term resolution through proper Incident and Problem Management
- Working with various teams like Infrastructure, development team to resolve, analysis of root cause for complex issues and outages
- Strong stakeholder management skills with focus on continuous service improvement, consistent delivery and stability of production.
- Drives Root Cause Analysis with technology partners, post incident resolution and facilitates RCA reviews.
- Work with Risk team to respond timely to Audit & Risk RFIs. Manage Audit walkthroughs
- An undergraduate degree or higher
- 10-15 years of strong experience in the Banking industry with minimum 5+ years in Run-the-Bank (RTB) lead role with a proven track record of managing mission critical applications SRE.
- Implement Site Reliability Engineering principles with regards to performance, reliability, monitoring, alerting and maintenance in Production environment. Pro-active Capacity monitoring & Observability of production Infrastructure, automated alerting, performance monitoring and reporting tools
- Automation of manual tasks in a CORE Banking ecosystem
- Build and maintain Production monitoring and automation solutions
- Build and implement Service improvements. Identify, measure and report performance trends – SLIs/ SLOs/ SLAs periodically and improve systems performance and associated performance KPIs
- Strong Hands-on experience in RDBMS / Unix / Cloud/ Mainframe (COBOL, JCL, VSAM, CICS, OPC) based large banking applications.
- Strong team player, effective at communicating internationally and used to working closely with remote teams.
- Solid understanding of BAU support, incident, problem management processes as well as escalation management across a diversified environment
- Understanding of Risk Management, Disaster Recovery, Business Continuity, IT Security Architecture, and IT Regulatory Compliance.
- Present facts and recommendations effectively in oral and written form
- Pro-active, independent, resourceful, and able to work in a team
- High attention to detail with focus on understanding the issues with finding solutions.
- Good to have: Good working experience in Elasticsearch, Logstash, Grafana/Kibana, Appdynamics etc.
- Good to have: Experience in Machine Learning/ AI for process efficiency improvements and automation is an added advantage.
- Must have Good functional knowledge of CASA (Current Accounts, Savings Accounts), Customer Master, Cheque processing, Signature maintenance, Customer Statement generation systems – Mainframe and Open Systems.
- Must have strong hands-on experience in Mainframe environments and Open system environments.
- Hands-on SRE/ Production support experience majorly in Core Banking platform – Customer master systems, Current Account/Savings Account systems (CASA), Statements generation systems.
- Strong technical skills, e.g. scripting or programming experience, DBA/SA skills etc.
- Strong transformation and change management experience
- Software Configuration Mgmt., Quality Control Mgmt, Version Control Mgmt
- Operating System - Linux/Mainframe
- Cloud platforms. OpenShift/PCF/AWS
- Database – Postgres/EDB/MariaDB, In-memory database – Redis
- Programming Languages – Java, Spring boot, ReactJS / COBOL, JCL, VSAM, CICS
- Application Servers – IBM WebSphere, JBoss
- Middleware technology – MQ, File transfers
- Eventing systems - Kafka
- Scheduling software – Tivoli workload scheduler for Open systems, Mainframe(z/OS)